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ESTIMATING A MIXED-EXPONENTIAL RESPONSE LAW* 


F. J. ANSCOMBE 
Princeton University and Mathematicat 


This paper is concerned with the estimation of the parameters of a 
mixed-exponential response law, given some sample observations. Two 
alternative laws are considered, each having two parameters. The first 
law expresses the proportion F(t) of the population who have responded 
within time ¢ in the form 

F(t) = (1 ~— e™é. 
The second law states that 
F(t) =1-—-(1 +)”. 


The problem has been encountered in the context of market research, 
in estimating the “penetration” of a market by a new product; and it is 
therefore described in those terms. But mathematically identical prob- 
lems arise also in life testing and in fisheries research. The problem is 
similar to the well-known one in toxicology of fitting a response law 
(logistic, probit, or whatever) to dose-mortality data, with the exception 
that in the present case the points on the empirical response curve are 
not independent, the basic sampling distribution being multinomial 
instead of independent-binomial. In regard to statistical methodology, 
the paper aims to illustrate the role of the likelihood function in estima- 
tion. Whether the likelihood function has a “large-sample” appearance 
depends not only on the sample size but on the parametrization. 


1, MARKET PENETRATION 


MATTER of great interest to the manufacturers of groceries and other 
A products intended for household consumption is to forecast the sales that 
will later be achieved by a new product from information about sales made dur- 
ing a relatively short period following its first introduction into the market. 
The information about sales is derived from consumer panels, consisting of a 
sample of households (i.e. housewives) who report their weekly purchases of all 
products of specified types. 

Fourt and Woodlock [5] have suggested that the analysis of sales data should 
be performed in two parts. First the dates of first purchase of the new product 
by a household should be examined. Then the characteristics of repeat buying, 
its frequency and quantity, should be studied. First purchases are no doubt 
greatly affected by advertising, priceconcessions, and personal recommendation, 
whereas subsequent purchases will be influenced by the consumer’s experience 





* Read at the Annual General Meeting of the Institute of Mathematical Statistics, Stanford University, 
August 25, 1960. 
t Subsidiary of Market Research Corporation of America. 
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of the product. We may therefore expect first purchases and repeat purchases 
to differ in character. 

In this paper we are concerned only with first purchases, with estimating 
what is called the curve of market penetration of the new product. We wish to 
estimate the function F(t), which is the proportion of the whole population of 
households who have purchased the new product at least once within time ¢ 
after its introduction to the market. The data consist of records from a panel of 
n households extending over a period of duration T from the time of introduc- 
tion of the new product (which we shall regard as the zero point of the time 
scale). Let m be the number of panel households which purchased the new pro- 
duct, the times at which they first did so being denoted by h, ft, - - - , t»—these 
times may be listed in any order, for example in increasing order. For any use- 
ful information to be obtained, m must not be too small; usually but not nec- 
essarily, it is much less than n. Let / denote the average of the observed pur- 
chase times, i.e. 

mi = Lit; } ,»2,°°°*,m). (1) 


The empirical peneiration curve is defined for ¢ in (0, T) by the non-decreasing 
function 
F(t) = E.2 {the number of the (¢,) that lie in (0, ¢) }. (2) 
n 
It is found that this curve, after smoothing, tends to be concave from below. 
Thus the rate of penetration tends to decrease. 

As a first approximation in formulating a theory, we make some sweeping 
assumptions. We suppose the population of households to be fixed, and we re- 
gard the panel as a random sample from it. We ignore seasonal fluctuations in 
demand (such fluctuations can be allowed for roughly by modifying the time 
scale). We suppose the new product became instantly available at time ¢=0, 
and that the whole economic situation, advertising, promotion, and competition 
have remained constant thereafter. Under these conditions, the simplest as- 
sumption that could be made regarding first purchases of the new product 
would be that all members of the population were equally liable at all times to 
buy the product, if they had not already done so. We should then have an ex- 
ponential penetration law of the type 


F(t) = 1 — ee (3) 


This, however, does not fit the data. Fourt and Woodlock have suggested 
that only a proportion, say £, of the population is susceptible, and that for 
these households the exponential law holds; whereas the rest of the population 
is totally immune and will never buy the product. We then have a law of the 
type 


Fi) = (1 — e*, (4) 


where a and é are parameters, with a>0 and 0<£<1. Let us consider how to 
fit this law to sample data. 


2. FITTING THE RESPONSE LAW 


The density function for purchase times is 





MIXED-EXPONENTIAL RESPONSE LAW 


f(t) = dF(t)/dt = tae. 
Hence the likelihood function, given the sample observations, is 
L = [1 — F(T))"-"M f(t) 
(ga)" exp (—mat)(1 — & + ge*7)"—™. (6) 


On inserting the observed values of n, m, T and /, we have a function of § and a 
only which measures the relative plausibility of the observations for different 
parameter points (a, £). It is therefore interesting to examine the behavior of 
this function. Clearly L=0 for a=0 and for §=0, and if for a moment we do 
not impose the restriction that §<1, then L=0 also for £&=(1—e7*")—. (We 
assume here that m and n—™m are both positive.) Inside the region of the (a, &)- 
plane bordered by these three curves, i.e. in the region a>0, 0<&<(1—e7@7)-1, 
we have L>0. However, we shall regard values of & greater than 1 as inadmis- 
sible. It is to be noted that, n and 7’ being given, the likelihood function (6) 
depends on the observations only through the two quantities m and /, which 
are thus sufficient statistics. 

To find the point (@, ¢) where L attains its maximum we equate to zero the 
partial derivatives of In L with respect to — and a. The first of these gives 


m/n 


iia 1 — exp (-—&T) 





(7) 


On substituting (7) into the derivative with respect to a we obtain after a little 
reduction 

i 1 1 

pot Seen e-em (8) 
7 aT exp (aT) — 1 


Equation (8) is solved for & very easily if a table of the function a~!—(e*—1)~! 
is first prepared. Deemer and Votaw [3] have given such a table. Then é is 
found from (7). Equation (8) is soluble and yields a finite positive @ provided 
that 0<i/T <3. If it happens that // 7 > 4, which implies that the smoothed em- 
pirical penetration curve is not concave from below, we find that L approaches 
its upper bound when a—0 with at = m/nT. However, if the law (4) is anywhere 
near true and the observations are fairly extensive, this contingency will not 
arise. It can happen that the é given by (7) exceeds 1. 

Having found (@, £) and the corresponding value of L, which we shall call 
Limax, We can proceed to calculate contours on which L has fixed values less than 
Lmax, and so obtain a graphical representation of the behavior of L. If the in- 
formation provided by the sample is great enough, we shall expect L to be 
roughly proportional to a bivariate normal density function and the contours 
to be elliptical in shape. That being so, it is easy to show that if a and é are 
assigned independent uniform prior distributions, then the posterior probabil- 
ity that the true parameter point (a, &) lies outside one of these contours is just 
the value of L/Lmax on the contour. Thus the contour on which L/Lmax=1/10 
(say) can be regarded as enclosing a 90 per cent credibility region for (a, &), 
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in the absence of any strong prior opinion—provided it does not pass outside 
the admissible region where é <1. 


3. GROUPED OBSERVATIONS 


We have supposed above that the times (¢;) at which the m panel members 
who purchased the product made their first purchase were recorded exactly. 
In practice the times are rounded off, usually to a week, and when panel records 
are being collated summaries may be made at monthly or three-monthly inter- 
vals. Thus the available data consist of ordinates of the empirical penetration 
curve at certain discrete times, usually equally spaced, and not the whole curve 
observed continuously up to time 7’. Let there be k observation times, 7;, T2, 

++, Tea, Te=T; and let ri, ro, - - +, te be the numbers of panel members 
who made their first purchase of the product during the time intervals (0, 7), 
(T;, T2), «++, (Tea, T). We have ri+rot+ - +--+ +re=m. If k is large and the 
time intervals are all short, we may use the preceding formulas, supposing that 
the r; purchases in the interval (7';., 7;) were made at the time (7j;.+7')/2. 
If k is small a more exact treatment will be preferred. It turns out that we still 
have simple sufficient statistics and the preceding formulas are easily modified, 
provided that the observation times are equally spaced and 7;=j7T/k (j=1, 2, 

--+,k). It is convenient to introduce a new parameter p to replace a, 


p = 1 — exp (—aT/k). (9) 
Then in place of (6) we have for the likelihood function 
L = {1 — F(T)|"-0,[F(7,) — F(T;-1)]" 
= (e)"(1 — pL — & + ECL — pt, 
where 
8=f2+ 2st --- + (k— I)r%; 
and in place of (7) and (8) we have 
m/n 


i (12) 
— (1 — py 


—p k(1 — p)* (13) 
F 1— (1 —)* 


Equation (13) can be solved for # provided 0<s/m<(k—1)/2, and then 
(12) yields £& ZL vanishes when =0, when p=0, when p=1 and when 
t= [1—(1—p)*]_". 

4. A NUMERICAL EXAMPLE 


To illustrate the application of the above formulas, some fictitious data were 
compiled from a table of random numbers, to represent observations at 5 
(=k) equally spaced times on a sample of 1000 (=n) households following the 
law (4) with parameters £=0.3 and p=0.1. Each household in the sample has 
chances equal to 0.03, 0.027, 0.0243, 0.02187, 0.019683 and 0.877147 of making 
a first purchase of the new product respectively during each of the five cbserva- 
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tion periods or not at all during the time of observation. On sampling this mul- 
tinomial distribution, the following observations were obtained: 


r, = 33, re = 26, rs = 24, rg = 26, r5 = 13, n — m = 878. 
From them we calculate the sufficient statistics 
m/n = 0.122, s = 204; 


the likelihood function is 


L = (&)'*(1 — p)*[1 — & + &(1 — 1) 5), 


and we find 
6 = 0.15285, & = 0.21643. 


This maximum point of L is shown in Figure 1. The boundary of the figure is 
the contour for L=0, and the two interior contours are for L/Imax=1/10 (the 
inner contour) and 1/1000 (the outer one). It will be seen that the contours are 
far from elliptical in shape. Moreover, even the contour for L/Dmax=1/10 in- 
cludes parameter points having a very wide range for both é and p. The sample 
has given us only a vague idea of the parameter values, and is conspicuously a 
“small,” not a “large,” sample. The parameter p (or a) must be estimated from 
the curvature of the empirical penetration curve, and in this case there is little 
curvature. In fact, the example is somewhat untypical of the samples used in 
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Fia. 1. Likelihood contours for the numerical example—original parameters. 
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practice for predicting market penetration, being smaller in size and in duration 
of observation. Nevertheless it is worth considering a little further. 

What is comparatively well determined by the sample is F(7’), the penetration 
at the end of the observation time. The likelihood function behaves in a more 
“large sample” fashion if we take F(7’) as one of the parameters. Let us relabel 
it n, thus 


n = [1 — (1 — p)5], 


and express L in terms of p and 7, thus 


L= }'22(] ime n) 878} 


oe oat 
Uf — (1 — p)5}}22, 


This is a function of 7 multiplied by a function of p, and insofar as the contours 
of L are ellipses their axes must be parallel to the coordinate axes. The previ- 
ously calculated contours are shown redrawn for the parameters a and 7 in 
Figure 2, where we have supposed the time scale chosen so that 7 =k=5 and 
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Fia. 2. Likelihood contours for the numerical example, after change of parameters. 


p=1—e-*. (The contour for L=0, mostly not shown in the figure, consists of 
the lines »=0 and n=1.) The ellipticity is improved by taking a rather than p 
as abscissa. 

5. REPARAMETRIZATION OF THE RESPONSE LAW 


In general, we obtain what Jeffreys [9] has called orthogonal parameters if 
we express £ in terms of n defined by 


n = &(1 — e-*7) = e{1 — (1 — p)*]. (14) 


Then L becomes a function of 7 multiplied by a function of a or p. We can ap- 
proximate fairly well to the likelihood contours, if we wish, by evaluating the 
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second derivatives at the maximum and assuming that In L is quadratic. The 


second derivatives are: 
E ln =| n (15) 
= — ————_ ; 15 
On? Jy (1 — #) 
and for continuous observations 
mTé(E — 4) 
re jo ones 
a Re 
or for grouped observations 


in L mT?(1—p) mT&(é— 4 
ee ek is 
Oa A kp? q? 


The difference between (16) and (17) represents the loss of information on a 
due to grouping. For our numerical example, (15) is evaluated at —9336 and 
(17) at —236, and an approximation to the contour for L/Lmsax=1/10 is the 
ellipse 


4{9336(n — 0.122)? + 236(a — 0.166)?} = In 10 = 2.3026, 


which is very close to the true contour. If we substitute our estimates into 
(16) we obtain —246, and so we estimate that the loss of information on a 
caused by the grouping of the observations has been 10/246 or about 4 per 
cent. There is no loss of information on 7. 

The parametrization in terms of » depends on the observations—specifically, 
on the duration 7’. It is the most convenient parametrization for evaluating 
the precision of any estimate. But in other respects it is not particularly con- 
venient or “natural.” The main purpose of estimating the parameters of the 
response law is to predict future penetration, i.e. to infer F(t) for > T7—in par- 
ticular, perhaps, to estimate the limiting penetration for large ¢, which is just &. 
The original parameters £ and a are the most meaningful and intelligible. 


6. AN ALTERNATIVE RESPONSE LAW 


We have considered in detail how to fit the penetration law (4), which is the 
simplest that conforms at all with experience. Fourt and Woodlock [5] have 
found that predictions made by fitting (4) to the early part of a long observa- 
tion sequence tend to disagree with later observations by being too low. The 
empirical penetration curve does not flatten off as quickly as (4) indicates it 
should. 

Now the assumption that the whole population of households falls into two 
groups, all members of one group being equally susceptible to trying the new 
product, and all members of the other group totally immune, is not too plau- 
sible. Without abandoning the basic assumption of an exponential probability 
distribution for the first purchase time for any individual household, we might 
naturally suppose that the rate coefficient a had a distribution of values over 
the population, which could be indicated by a probability weighting dW(a), so 
that the penetration law for the whole population took the form 
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F(t) = fra — e*)dW (a). (18) 


We obtain (4) if the weight distribution is discrete, with positive weights only 
for one positive value of a and also for 0. Another possibility that may be con- 
sidered is a continuous gamma (Pearson Type III) distribution, 


dW(a) = a’ 'e-*\da/T(v)X’, (19) 


with two positive parameters \ and v. Then (18) gives 


r 
f® = 


: (20) 
~ a+) (1 + dt) 


F(t) = 1 

As t increases, F(t) approaches 1, but does so very slowly if » is small, say less 

than }. It then happens that the early part of the curve (20) resembles (4) in 

appearance, but departs from it for larger ¢ in the way that has been observed. 

Given a continuous observation record over (0, 7), the likelihood function 
for (20) is 


L = (av)™(1 + AT)—O-™ TE (Lt. (21) 


If m is substantial, this is far more troublesome than (6) to compute, because 
of the absence of simple sufficient statistics. Similarly, the likelihood function 
obtained from grouped data is more troublesome than (10). 

It is easy enough to fit (20) “inefficiently” but quite well. One can choose 
\ and » so that (20) passes through two assigned points on the empirical pene- 
tration curve, for example, through the end point and the point whose ordinate 
is a half that of the end point. One may then examine the goodness of fit. 

But since the likelihood function contains theoretically all the information 
available concerning the parameters, it is interesting to consider what can be 
done with (21). It is not impossible to compute In L and its first and second 
derivatives in a straightforward manner, especially if an automatic computer 
is available. The observed times should be rounded sufficiently for the number 
of different times recorded to be manageable (say less than 50). It would seem 
to be impossible to express \ and » explicitly in terms of exactly orthogonal 
parameters, but the device that yielded an orthogonal parametrization of the 
law (4) greatly improves the orthogonality in the present case. If we define » by 


uw=vin(l1+AT), (22) 


and express L in terms of \ and uy, eliminating v, we find that d°ln L/d\du is 
very small relative to the other two second derivatives, when d is large. The 
reparametrization does not greatly worsen the computation of In L and its 
derivatives, and may be considered worthwhile, because we may hope to obtain 
a good approximation to the likelihood contours by evaluating merely the sec- 
ond derivatives of In L at the maximum and assuming In L to be quadratic. 

A possibility that suggests itself for simplifying the computation of (21) is to 
use a polynomial approximation to the logarithms that appear in In L, of the 
form 


In (1 + 2) = aye + agr? + - + - + a2”, (23) 
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valid for x ranging from 0 up to the highest value for \7' thought to be of inter- 
est. (For the derivation of such polynomiels see ref. [8].) Equation (21) now 
becomes 


In L = min (Av) — (n — m)v In (1 + AT) 
— (v + 1) far(Zt) + aad*(Sd2) + - - > + a,d7(EA)}. (24) 


With this approximation we have sufficient statistics, namely (in addition to 
n and 7’) m and the first p moments of the observed times (¢;). The required 
power p of the approximating polynomial depends on the upper bound of the 
values of \7' to be considered, and can be determined by trial. Presumably 4 is 
the least value that could ever be satisfactory. No very great accuracy in ap- 
proximating In L is called for. If a modest value for p is indeed good enough, 
then the maximum of (24) and contours round it can be computed by direct 
search, varying v for each chosen value of \; derivatives need not be evaluated 
—they may be misleading. With this method, there is no point in reparametriz- 
ing L by (22). 


7. REFERENCES AND ACKNOWLEDGMENTS 


Gulland [6] has suggested the response law (4) in studying commercial fish 
populations. He observes that there are two distinct types of mortality, that 
due to capture by man and that due to other causes, and suggests that they can 
be regarded as acting independently and exponentially, with rate coefficients 
F and M, say. If n marked fish are released in the fishing ground, which is then 
fished continuously for a period 7’, and the dates of recapture of marked fish are 
recorded by the fishermen, we have an estimation problem identical with that 
considered above in Section 2, with the following relation between the param- 
eters: 


t= F/(F + M), a=F+M. 


The purpose of the study is to estimate F and M separately. Gulland gives the 
likelihood function (6) and an approximation to the maximum-likelihood esti- 
mates. I am indebted to Dr. Gerald J. Paulik for this reference; he has made 
further studies of the same sort in connection with the Pacific salmon fisheries. 
A broad area of relevant literature has been reviewed by Chapman [2]. 

The same problem would occur in life testing, if an unknown proportion é of 
the population were subject to an exponential failure law with unknown rate a, 
whereas the rest of the population was not subject to failure. A related problem 
has been studied by Mendenhall and Hader [11], of a population composed of 
two subpopulations following different exponential failure laws, where it is pos- 
sible to identify which of the two subpopulations each observed failure comes 
from, but nonfailing individuals are not so identified. 

Maguire, Pearson and Wynn [10] have suggested (20) as a distribution for 
time intervals between industrial accidents. They do not consider the estima- 
tion problem, which in any case would be different from that considered here, 
owing to the different nature of their data. 

The graphical presentation of likelihood functions is not common in the 
literature at present, although (as I understand) it is a regular practice with 
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some persons, notably with Professor G. A. Barnard. One-dimensional likeli- 
hood functions have been published by Haldane and Smith [7] and by several 
later writers in the same journal, and also by Fisher [4]. A two-dimensional 
likelihood function was tabulated by Brown [1]. 

The device of polynomial approximation used to derive (24) has some sim- 
ilarity to, though it is not identical with, a technique developed by Shenton 
[12]. He expresses the first derivative of In L as a series of orthogonal poly- 
nomials with respect to the chance distribution of the observations, and so de- 
duces an expression for the efficiency of moment estimates. 

In preparing this paper I have been greatly helped by conversations with 
many persons, especially with Professor O. Kempthorne and with my colleagues 
during my year (1959-60) in the Statistics Department of the University of 
Chicago. 
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A CLASS OF DISTRIBUTIONS APPLICABLE TO ACCIDENTS 
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This paper presents an extension of the mathematical model used to 
justify accident proneness. It assumes that the distribution of accidents 
incurred by an individual in non-overlapping intervals is a correlated 
bivariate Poisson (C.B.P.). On compounding this correlated bivariate 
Poisson through a Gamma distribution an extended bivariate negative 
binomial or, more precisely, a compound correlated bivariate Poisson 
(C.C.B.P.) distribution is obtained. Recurrence relations and expres- 
sions for the required probabilities are illustrated for two sets of data. 
The C.C.B.P. proved to fit as well as the bivariate negative binomial 
when the estimate of Biz was close to zero, and much better than the 
latter distribution when the estimate of Bi. was not close to zero. 


1. INTRODUCTION 


n 1919 Greenwood and Woods [11] formulated three models—pure chance, 

proneness, and true contagion—to explain the observed frequencies of acci- 
dents. Their work and that of Greenwood and Yule [12], who presented the 
mathematical theory of the above article, have formed the basis for many of 
the subsequent statistical investigations concerned with the distribution of 
accidents. Arbous and Kerrich [2], by use of generating functions, give a 
thorough review of these results. 

As is well known, all three models assume that the individuals exposed to 
accident are working under conditions of equal environmental risk. If accidents 
are assumed to occur by pure chance in a homogeneous population this model 
leads to a Poisson distribution. 

The proneness or apparent contagion model assumes that workers are not 
equally liable to have accidents. Rather, they can be divided into subgroups, 
each of which has its own susceptibility to accidents. The overall distribution 
of accidents is then a compound formed by a number of sub-distributions each 
of which is a Poisson, with the Poisson parameters distributed in the manner 
of a Pearson Type III curve. The resulting distribution is a negative binomial. 

The third model, a biased or true contagion model, assumes that the indi- 
viduals all start work with equal probability of incurring an accident. However, 
if one incurs an accident by chance, his liability of sustaining further accidents 
is increased or decreased in a certain way. This model results in a negative 
binomial distribution, which, in the univariate case has the same parametric 
form as that which results from the model of proneness [14], [13], [9]. Where 
two variates are involved, however, such as the number of accidents sustained 
by the same individual in two non-overlapping time intervals, the bivariate 
negative binomials have only slightly different parametric forms [2], [3], [10]. 
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To test the fit of the two bivariate negative binomial distributions Arbous 
and Kerrich [2] estimated the parameters by the method of moments and for 
their data the hypothesis of proneness was more plausible than the hypothesis 
of true contagion. Yet, as Sichel [16] states, estimation of the parameters of 
the negative binomial distribution by moments is often inefficient and may 
cause possibly true hypotheses to be rejected. 

The inconclusiveness of such models is evident. A good fit of a distribution 
does not necessarily justify the assumptions of the underlying model [3], [4], 
[15]. When the negative binomial gives an appreciably better fit than the 
Poisson distribution, as it usually does, it is impossible in the univariate situa- 
tion to decide which of the models, true or apparent contagion, is the more 
plausible. In the bivariate situation an extremely sensitive test is required to 
distinguish between the two models. 

In many instances the distributions obtained through the models above do 
not fit accident data well. Bates and Neyman [3] give several such examples. 
Consequently it seems reasonable to extend these models in order to allow 
greater flexibility in the form of the resulting distributions. The purpose of this 
study is to develop a possible extension of the mathematical model used to 
justify accident proneness. It is also possible that the resulting distributions 
might be obtained on the basis of an extended model of contagion, however such 
a development is not included here. 


2. MODEL BASED ON PRONENESS 


The family of bivariate negative binomial distributions which arises from 
the proneness model set forth by Arbous and Kerrich [2] is a special case of the 
family of bivariate distributions which arises from the extended proneness 
model given in this paper. To appreciate the distinction between these two 
families various formulae which concern the bivariate negative binomial dis- 
tribution will be summarized here. 

Use will be made below of the technique of probability generating functions 
in order to simplify the presentation. This technique is similar to that of mo- 
ment generating functions. In fact, if gy(z) and my(t) are the probability gen- 
erating function (p.g.f.) and moment generating function (m.g.f.) respectively 
of Y, then 


gy(z) = po + prz + poz? + pe? +::: 
te ts 
my(t) = 1 + mi’ + m2’ at us’ mec 2 


where 
prp=P{Y=r} and u/ = EY*. 


It follows that my(t) =gy(e‘). The extension of these concepts to several dimen- 
sions is quite straightforward. 

Let X, be the number of accidents sustained by an individual in the time 
interval J,;=(T'n, Ti) of length 4, let X_ be the number of accidents sustained 
by the same individual in the time interval J;=(T, Tx) of length 6, and let 
X = X,+X_ be the number of accidents sustained by this individual in the com- 
bined time interval J=1,+], of length 6=6,+4:. 
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Assume that the population is a mixture of homogeneous populations so 
that the conditional probability for a given \ of an individual sustaining r acci- 
dents in J, and s accidents in I, is the product of two independent Poissons with 
parameters 6,\ and 6A, respectively. Upon further assumption that \ follows a 
Pearson Type III distribution, namely 


BENE 1g-BA 
SE ninco A>0,8>0,k>0 
P'(k) (1) 


= 0 A<0 


p(r) 


the joint distribution of X,, X, is a bivariate negative binomial with probability 
generating function (p.g.f.) 


9x,,X,(21, 22) = E(z:*'2.*2) = [A’ + Byat+ B,’z2|-* (2) 


where 


By 


= (1 — B,’ — B,;') > 0. 


The marginal distributions of X, and X, and the distribution of the sum X 
are univariate negative binomials with p.g.f.’s of the form (q—pz)-*, where 
q>0, p>0, gq=1+>p. The regression of X, on X;, is linear and conversely. 

When 4, and 6 are known, estimates of the two independent parameters of 
the bivariate negative binomial distribution, B,’ and k, may be obtained from 
two moments of the distribution of the sum X, namely w,’=E(X) and 
uo = E(X —y,")?, by means of the relationships 

By = _ ft a , 
rpit ) ' 
— |e 
ro 

(u1’)? 


Me — My’ 


bn 


3. EXTENDED PRONENESS MODEL 


The C.C.B.P. (compound correlated bivariate Poisson) distribution arises 
from compounding a C.B.P. (correlated bivariate Poisson) with a Gamma dis- 
tribution, Equation (1). Under the slightly more general proneness model this 
may be interpreted as implying 

(a) that the population considered may be divided into subgroups each hav- 

ing a different but unchanging liability to accident, 

(b) that the number of accidents incurred by an individual in any one such 

group over two time periods follows a C.B.P. distribution. 
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Definition of homogeneous population. The assumption of a C.B.P. distribu- 
tion extends the notion of a homogeneous group. It is generally agreed that one 
can never prove that a population is uniform with respect to personal and en- 
vironmental attributes which affect the occurrence of accidents. However, ac- 
cording to Arbous and Kerrich [2], if the distribution of accidents in each of 
two non-overlapping intervals of time appears to be a Poisson, and if there is 
no evidence of correlation between accidents in these two time periods, the 
population is assumed to be nearly homogeneous. The more general model dis- 
cussed here leaves unchanged the idea that a homogeneous population consists 
of individuals all equally liable to accidents. However, it replaces zero correla- 
tion between occurrence of accidents of an individual in one time period and 
occurrence of accidents of the same individual in another by constant (non- 
negative) correlation. Thus, the C.B.P. distribution asserts that for every indi- 
vidual with the same liability to accident there exists, in addition to the con- 
stant mean number of accidents, a constant correlation between the number of 
accidents in one time period and the number in another. 

Correlated bivariate Poisson distribution. The distinction between the assump- 
tions of these two proneness models is evident from the form of the p.g.f. for 
the C.B.P. distribution. Let X,, Xs, and X be defined as before. Then the C.B.P. 
distribution has p.g.f. 


Qx,.xyin(21, 22) = exp{A[Ai(21 — 1) + 62(@2 — 1) + Ara(e1 — 1)(@2 — 1))} 
A> 0, (4) 
0 < 42 < min (4;, 42) 


where \@,; and dé are the means of the marginal Poisson distributions while \62 
is the covariance parameter. When 6,.=0 this p.g.f. reduces to that for the 
product of two independent Poissons with means \é, and N44, the joint distribu- 
tion employed previously in the definition of a homogeneous population. 

Derivation of the C.P.B. distribution may be found in Campbell [5], Aitken 
[1], or Feller [8]; and Teicher [17] gives useful recurrence relations. Maritz 
[15] has described accident data with it. 

Compound correlated bivariate Poisson distribution. Upon the assumption that 
\ has a Gamma distribution, p(A), given in Equation (1), the unconditional dis- 


tribution of accidents in two non-overlapping time intervals, which is a 
C.C.B.P., has p.g.f. 


9x,,.X.(21, 22) -f 9X;,X4)A(21, Z2)p(A)dA 
0 


= [A + By, + Bozz + By22:22|* 


1 
_ 3 (0; — 02) < 0, 


1 
— — (02 — O12) < 0, 
8 
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G12 
By a = ¢ 
B 


A; 62 P12 
A=1+—+—-—e=(1—B, — B.— By) > 0. 
8 8 8 
The similarity of this p.g.f. to that of the bivariate negative binomial in 
Equation (2) is obvious. When 6.=0, then Bj,.=0 and the two p.g.f.’s are 
identical. Thus the bivariate negative binomial is a special case of the C.C.B.P. 
distribution. When 6.0, then B, and Bs measure the difference between 6, 
62 and between 62, 62, respectively, on an appropriate scale dependent upon 8, 
while k as before is an index parameter. 
As is true of the marginal distributions for the bivariate negative binomial, 
the marginal distributions of X,; and X, for the C.C.B.P. distribution are uni- 
variate negative binomials. That is, from Equation (5), X, has p.g.f. 


gx,(z1) = gx,,x.(z1, 1) = [(A + Bs) + (Bi + Bis)ai}~*. (6) 


The mean and variance of the marginal distribution of X, can be found through 
differentiation of the moment generating function (m.g.f.) 


mx,(t) = gx,(e") = [(A + Bz) + (Bi + Bie} (7) 
which gives 
mio = E(X1) = — k(Bi + By), 
woo = E(X, — ww’)? = — k(Bi + Bu)(l — Bi — Bi). 
Similar results are obtainable for the marginal distribution of X». 


The distribution of the sum, X, for the C.C.B.P., however, is not a univariate 
negative binomial, as before, unless By,=0. It has p.g.f. 


gx(2) = gx,.x,(2, 2) = [A +- (Bi + Bz)z + Bisz*}* (9) 
with mean and variance 
ui’ = E(X) = — k(Bi + B2 + 2By), 
yo’ = E(X — my’)? = k[(B, + Be + 2By2)? — (Bi + Be + 4By)]. 


The covariance of X,; and Xe, which can be found from the m.g.f. of the joint 
distribution, is 


| ine E(X, ar H10) (Xe Sa on’) _ k{By(B, + B, + Bye -“ 1) + BB]. (11) 


As before the regression of X, on X, is linear. This can be seen from considera- 
tion of the first moment about the origin of the conditional distribution of X 
accidents which occur in the second time interval given X,=r accidents oc- 
curred in the first time interval. This distribution, which is that of the sum of 
two independent random variables, one with a positive binomial distribution 
and one with a negative binomial distribution, has p.g.f. 


B,+ = |(- + Bez + 
B, + By A+B, 


(8) 


(10) 





(12) 


9X »|X =r(2Z2) ™ [ 
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Both its mean and variance, 
ais X. ory = sy | Statin |r 
A+B, B,+By A+B, 
kAB, BiB, AB, 
ap (A + B,)? le + By)? ag (A+ a ; 





(13) 








u2(X: | Xi=r)= 


are linear in r. 

Probabilities and recurrence relations for C.C.B.P. distribution. For the 
C.C.B.P. distribution the probability that r accidents occur in the first time 
interval and s in the second can be expressed as 


s (-1)*T(k+r+i) [BB 
. = A-*-'B,*B,.* 
Px,,x,{7, 8} = A*"By"B ede DET® iw ete 


r,s,i=0,1,2,---. 





Re e 


’ 


This follows from the p.g.f. in Equation (5) since by definition 


9x,,x2(21, :»\ (15) 


21,290 


orte 
ris!Px,.x,{r, 3} = \ 
022°02z)" 
Explicit representation of the right-hand side of Equation (15) is obtained by 
application of Leibnitz’s formula for the derivative of a product. 
When s>r, Px,,x,{r, 8} may be obtained from Equation (14) by reversing the 
positions of r and s, B, and Bz. 
Several recurrence relations can be obtained for the joint probabilities of the 
C.C.B.P. distribution. Three of the more useful are 
e rik+r+i) 


Pr { = (—A -rB r-8B * 
xi.xath, 8} ( "Bi - X (r —s +i) 's — 2) IT(k + 2) 





16 
cece | (16) 
. 5 | Prxsl0, i, 


s 1 
(k+r+ 2) > 


Px l, <d Px,,x, ’ ’ 
meth oe | tinea 








t 
Px,,.x,{r + t, r} = | Px,,x,{r, r+ t} 


2 


a r,s,t,4 =0,1,2,---. 


Here Px,,x,‘®{r, 8} in Equation (17) refers to the ith term in the above ex- 


pression for Px,,x,{r, s}, Equation (16). 
With respect to the distribution of the sum X, a general expression for the 
explicit probabilities is complicated. It is more convenient to use the simple re- 


currence formula 


Pulel at ei "| Pstr - 1} 


A 
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— By 2+r—2 
+| iil > |Pxtr - 2}. (19) 


Estimation of parameters for the C.C.B.P. distribution. If it is assumed that 





where 6, and 4, are known, we have only three independent parameters to esti- 
mate, Bz, By and k, since 
6 
(> as 1) Da. 
52 


(If this assumption about @, and #2 is not permissible, one would, of course, 
have four parameters to estimate.) One combination of moments that might be 
used to provide these estimates is io’, ui, aNd woo. Estimates of the parameters 
could then be obtained from the relations 

Mio. 


, 
ay 


& 


where 


, b2 
Moz — #10 I. | 
= 5? 
Ki0 =| 


upon substitution of sample moments for the corresponding population values. 
Other combinations of moments are, of course, possible. More efficient 
methods of estimating the parameters will be presented in a later paper. 





? 


4. PRACTICAL APPLICATIONS 


The data in Table 1 were obtained from Farmer and Chambers [7] and con- 
sist of the numbers of accidents sustained by 166 London omnibus drivers in 
two consecutive years. When these data are fit by a C.C.B.P. distribution as 
well as by a bivariate negative binomial, the frequencies which result are those 
of Tables 2 and 3, respectively. 

As seen from Table 4, the C.C.B.P. appears to fit the data somewhat better 
than the bivariate negative binomial. However, both fit the data well. The two 
different grouping procedures used concur in these results. The first procedure, 
Grouping (a), recommended by Cochran [6] for unimodal distributions, re- 
quires a minimum expectation of at least 1. The second procedure, Grouping 
(b)—a minimum expectation of 2 or more—is that suggested by Cochran [6] 
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for contingency tables with very small expectations and less than 30 degrees of 
freedom. For the purposes of comparison both methods are used in computing 
chi-square values for all four distributions. 

The regression lines of Figure 1 illustrate schematically how similar the fits 
of these two distributions are to the actual data. This is to be expected since 
the estimated value of By is very close to zero. 

A hypothetical example is now presented to illustrate the situation where a 
large negative value of By. causes the C.C.B.P. distribution to approximate the 
data much better than the bivariate negative binomial. It is, of course, obvious 
that there are infinitely many sample distributions which are fit more closely 
by the C.C.B.P. than by the negative binomial, particularly when Bj, has large 
negative values. The purpose of this example is merely to illustrate the effect 
of a particular value of By, along with other values of the remaining parameters. 
Assume that the parameters of a C.C.B.P. distribution are k=0.75, By = — 4.00, 
B, = B,= —3.33, so A=11.67. A random sample of size 300 drawn from this 
population is shown in Table 5. It is a long-tailed distribution. 

To estimate the parameters, sample moments are calculated on the basis of 


TABLE 1. OBSERVED NUMBERS OF MOTOR VEHICLE ACCIDENTS 
AMONG 166 LONDON OMNIBUS DRIVERS DURING 
TWO TIME INTERVALS! 


r=number of accidents, year 2 
s=num- 
ber of 
accidents, 
year 3 


0 


















































Pe ii " | 
Total | 40 | % | 13 166 





1 The heavy lines indicate Grouping (a) of the data for the chi-square test of fit to the joint distribution (Xi, X: 
to yield a minimum expectation ot 1, The dotted lines indieate Grouping (b) of the data to yield a minimum expec- 
tation of 2. 
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TABLE 2. EXPECTED FREQUENCIES OF ACCIDENTS CALCULATED FOR 
DATA OF TABLE 1 WHEN C.C.B.P. DISTRIBUTION IS ASSUMED AND 
WHEN PARAMETERS ARE ESTIMATED BY RELATIONS IN 


Gm: i =3.9691, B,.= — 0.0242, B, = =B, = —0.3644 

















s =number | r=number of accidents, year 2 
of accidents, —— Total 
years | 0 1 2 3 + 5 6 7+ 
Sa ee ; ] 3 | 
0 15.09 | 13.58 | 7.23 3.97 | 1.03 |! 0.32 0.09 | 0.03 | 40.34 
1 13.58 | 16.58 | 11.15 | 5.52 52 | 2.26 7a 0.81 1 | 0.18 18 | 0.08 | 50.16 





| | 























23 |11.15 | 9.15 | 5.38 | 2.55 | 1.08 0. 2 a1 | 0.13 36.84 

3 | 2.97 | 5.52 | 5.38 | 3.68 | 1.99 | 0.92 | 0.87 | 0.14 | 20.97 
eae eetnensiate . 2S + See a 

4 | 1.03 | 2.26 | 2.55 | 1.99 | 1.22 | 0.62 | 0.28 | @.12 | 10.07 

ae head -s1 | 1.04 | 0.92 | 0.62 | 0.36 | 0.15 | 0.08 | 4.30 











0.28 





0.15 1.43 














0.08 


| 
0.03 0.02 0.01 0.22 


0.66 




















Co 
=) 
_ 
—) 
i) 
or 
Oo 
4 


9+ | 0.00 | 9. 0.01 


Total 40.35 | 50.19 


0.01 | 0.02 








| 
01 | 0.02 | 0.02 | 0.01 e}. 0.01 | 0.93 | 1.01 


0 
| 36.89 89 | 21.04 | 10.1 13 fs 4.34 | 1. 1.46 | 1.60 [166.00 





























the 251 observations recorded in the cells (7, 7), 7, 7=0, 1, 2, , 14, of Table 
5. The 49 observations corresponding to 15+ were spread over such a long tail 
that it was not feasible to indicate the individual values in the random sample. 
The estimates based on the truncated portion of the distribution were quite 
adequate for the purpose indicated below since we are merely comparing the 
chi-square values as an ad hoc measure of closeness of fit. Estimates of the 
parameters for the ©.C.B.P. distribution so obtained are k=1.2129, 
By = — 1.3346, B, = B,= — 1.4820, so A =5.2986. Estimates of the parameters 
for the bivariate negative binomial distribution are k’ = 1.1149, B,’ = —3.0591, 
so A’=7.1183. 

As Table 6 shows, when the above estimates of the parameters are used to 
fit the observed distribution for the sum, X, obtained from Table 5, the chi- 
square value for the C.C.B.P. is very much smaller than that for the bivariate 
negative binomial. When the known parameter values of the C.C.B.P. are used 
to fit this distribution, the chi-square value 14.02 differs only slightly from that 
obtained using the sample moments. To check whether the magnitude of the 
chi-square value that results when the bivariate negative binomial is fit to this 
same distribution of the sum could be reduced, various parameter values were 
selected. None of these were able to give a better fit than that produced by 
moment estimates. This example illustrates that the C.C.B.P. distribution may 





© =observed mean X,| X;=S 
A =theoretical mean, C.C.B.P. X,| X:=S8 
(_]}=theoretical mean, bivariate negative binomial Xx, | X:=S 


T 


6 








X:=S 


Fic. 1. Comparison of observed regression of X, on X:=S to theoretical regression 
of C.C.B.P. and bivariate negative binomial for data of Table 1. 


TABLE 3: EXPECTED FREQUENCIES OF ACCIDENTS CALCULATED FOR 
DATA OF TABLE 1 WHEN A BIVARIATE NEGATIVE BINOMIAL 
DISTRIBUTION IS ASSUMED AND WHEN PARAMETERS 
ARE ESTIMATED BY RELATIONS IN (3): 


f’ =2.7525, B,' =B,' = —0.5406 








Number of Number of accidents, year 2 
accidents _ 





we Br sa 





1.18 | 0.42 | 0.14 0.05 


2.07 | 0.84 | 0.32 | 0.12 | 48. 76 





— ate 


| 
2.09 | 0.95 | 0.43 | 0.16 | 32.07 07 
| | 








58 80 37 | 0.16 | 17.75 





.00 , . : 8.95 





.59 ‘ é ; 4.26 





0.14 | 0.32 











| 0.00 | 0.01 


























Total | 50.53 48.81 
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TABLE 4. VALUES OF CHI-SQUARE OBTAINED FROM COMPARISON OF 
THE OBSERVED, JOINT, SUM AND MARGINAL DISTRIBUTIONS OF 
TABLE 1 TO THE THEORETICAL DISTRIBUTIONS 
OF TABLES 2 AND 3 











C.C.B.P. Bivariate negative binomial 


Distribution Table 2 Table 3 










Grouping (a)! | Grouping (b)? | Grouping (a)! | Grouping (b)? 












Xi, X2 (joint) 

























probability 0.76 0.68 





x? 28 .63 26.75 37.94 33.45 
df. 31 24 32 25 
probability 0.59 0.33 0.22 0.13 
X (sum) 
x? 4.96 4.80 10.91 | 9.46 
d.f. | 8 7 10 8 
0.45 | 0.32 
| 





X, (marginal, year 2) 





x2 7.31 5.74 8.84 7.30 
d.f. 5 4 6 4 
probability 0.21 0.23 0.20 0.13 














| 
X:2 (marginal, year 3) 
x* 2.94 2.82 11.42 9.10 











d.f. 6 4 6 4 
probability 0.81 0.59 08 0.06 





1 Cells are grouped to obtain a minimum expectation of at east 1. 
? Cells are grouped to obtain a minimum expectation of at least 2. 
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TABLE 5. RANDOM SAMPLE OF SIZE 300 DRAWN ON BASIS OF 
HYPOTHETICAL C.C.B.P. DISTRIBUTION WITH PARAMETERS 
k=0.75, Be = —4.00, B, = B: = —3.33, SO A = 11.67 








s =number of r=number of accidents, interval 1 
accidents, 
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TABLE 6. COMPARISON OF FIT OF C.C.B.P. AND BIVARIATE NEGATIVE 
BINOMIAL TO OBSERVED DISTRIBUTION OF THE SUM OBTAINED 
FROM TABLE 5 WHEN PARAMETERS ARE ESTIMATED 
BY THE METHOD OF MOMENTS! 






































poe rebel Expected frequency Chi-square 
fuse C.C.B.P. biv. N.B. C.C.B.P. biv. N.B. 
0 40 34.00 53.89 0.90 3.58 
1 30 23.08 45.38 2.07 5.16 
2 29 | oF. | ae 0.76 1.39 
3 16 | 21.41 28 .24 1.37 5.31 
4 18 19.51 21.90 0.12 0.69 
“aes Ss he 16.88 3.92 3.68 
6 11 | 15.24 12.95 1.18 0.29 
7 11 ae 9.92 0.43 0.12 
8 11 pe ft. 7.58 0.04 1.54 
9 | 9 | 10.25 5.78 0.15 1.79 
10 10 8.94 4.42 0.13 7.04 
11 | 8 7.79 3.37 0.06 6.36 
12 6 6.73 2.57 0.08 4.58 
13 3 5.86 2.11 1.40 0.38 
14 4 5.09 1.59 0.23 3.65 
156+ | 85 75.10 | 47.44 1.31 29.78 
Total df. | 
probability | 300 300 .00 | 300.00 14.15 75.34 
12 13 
| 0.30 < .005 




















1 Estimates of the parameters for the C.C.B.P. are t= 1.2129, Bu=- 1.3346, Bi =B: = - 1.4820, so 4 =5.2986; 
estimates of the parameters for the bivariate negative binomial are hk’ =1.1149, Bi’ =f,’ = —3.0591, so A’ =7.1183. 
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© =observed mean X,| X.=S 
A =theoretical mean, C.C.B.P. X,| X.=S 
{_]=theoretical mean, bivariate negative binomial X,| X:=8 








X:=S 


Fic. 2. Comparison of observed regression of X,; on X:=S to theoretical regression 
of C.C.B.P. and bivariate negative binomial for hypothetical data of Table 5. 


explain data when the bivariate negative binomial does not. 

The regression lines in Figure 2 indicate fairly closed agreement between the 
two distributions under consideration on the basis of the observed data. It 
appears that regression, in contrast to chi-square, is of little discriminatory 
value. 
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EX ANTE AND EX POST DATA IN INVENTORY INVESTMENT* 


Murray Brown 
University of Pennsylvania 


Expectational data on manufacturers’ inventory investment and sales 
are analysed. These data are provided by Fortune Magazine’s survey of 
corporations. It is found that firms tend to underanticipate investment 
changes and that entrepreneurs tend to be conservative in anticipa- 
tions when economic conditions are depressed and less conservative 
when conditions are favorable. A theory of investment behavior is then 
derived and two realization functions are tested, one assuming perfect 
competition and cost minimization and the second assuming profit 
maximization and an oligopolistic market. The second performs best. 
Certain ex ante and ex post estimating forms are tested; the ex post 
form yielding excellent statistical results with real inventory stock as 
dependent variable involves the explanatory variables of real unfilled 
orders, real sales and a distributed lag variable. A major conclusion is 
that the Fortune ex ante data appear to be only marginally useful in 
explaining inventory investment; and until observation errors are sub- 
stantially reduced (inter alia), the marginal work units involved in using 
this data may not be warranted. A positive conclusion is that the real 
unfilled order variable appears to be highly useful. 


INTRODUCTION 


HE most volatile component of GNP is inventory investment. Perhaps, for 
‘Dat reason successful inventory predictions using endogenous economic 
variables such as ex post sales have been difficult to achieve. Thus the in- 
sistent need for reliable predictions of the inventory variable to establish 
short run policy has induced organizations such as Dun and Bradstreet and 
Fortune Magazine to survey corporations in order to use ex ante data in fore- 
casting inventory investment.' In addition, these surveys may have been 
prompted by the annoying suspicion that the inventory decision may embody 
expectations which cannot be represented by ex post data; therefore, predic- 
tions based on ex post variables may, sui generis, be abortive.? In this paper we 
shall focus on the Fortune data which are obtained by a quarterly survey of 
approximately two hundred corporations and which elicit inventory and sales 
anticipations. The Fortune data provide a continuous panel which is unique 
in that it is the only ex ante data on the magnitude, as well as the direction of 
change, of inventory investment. 

The purposes of the present paper are as follows: 1) we shall use the Fortune 
data to establish certain patterns with respect to inventory and pricing ex- 
pectations, to increase our knowledge of the expectational process; 2) we shail 








* The author is indebted to Irwin Friend for his comments and Mr. Todd May, of the Business Roundup 
staff of Fortune Magazine for his kind aid in providing the expectations data. The results in the appendix were com- 
puted by the University of Pennsylvania Computer Center. 

1 For a description and analysis of the Dun and Bradstreet surveys, see M. Hastay, “The Dun and Bradstreet 
Surveys of Businessmen’s Expectations,” Proceedings of the Business and Economics Section of the American Sta- 
tistical Association, September 1954, pp. 93-123. 

2 Cf. Irwin Friend, “Critical Evaluation of Surveys of Expectations, Plans, and Investment Behavior,” in 
Expectations, Uncertainty, and Business Behavior, edited by Mary Jean Bowman. Social Science Research Council, 
Council, New York, 1958, p. 190. 
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inquire into some of the reasons for discrepancies between actual and antici- 
pated inventory stock; this will provide knowledge by which we may utilize 
ex ante data more effectively; 3) certain ex post forecasting forms based on 
sales, unfilled orders, new orders and the method of distributed lags will be 
developed; these are combined with forecasting forms utilizing the Foriune 
expectational data and examined in terms of their statistical reliability. 

It may be useful to anticipate certain broad findings of the paper. In Sec- 
tion 1 the following expectational patterns are presented: firms systematically 
tend to underanticipate investment changes, suggesting that the holding of 
buffer stocks, among other things, permits a conservative attitude toward in- 
ventory investment (i.e., entrepreneurs feel they may hold to a slowly changing 
inventory policy in the face of larger changes in business activity); entre- 
preneurs tend to be conservative in anticipations when economic conditions 
are depressed, and less conservative when times are good; manufacturers’ 
forecasting errors, as indicated by the Fortune data, tended to be small over the 
period considered but the variance of the errors was large. 

The theoretical heart of the paper is presented in Section 2. In order to esti- 
mate indirectly some of the reasons for discrepancies between actual and anti- 
cipated inventory stock, we use the method of realization functions. Here a 
theory of inventory investment is derived from the profit maximization pro- 
cedure of the firm, which specifies the variables entering into realization func- 
tions. Two realization functions are tested, one assuming cost minimization 
and perfect competition, and a second assuming profit maximization and an 
oligopolistic market. The second performs best. A price hedging equation 
yielded inconclusive results. 

Readers interested in the use of ex ante and ex post variables in prediction 
equations should skip to Section 3. The ex post forms that yielded the best sta- 
tistical results in predicting real inventory stock involved the explanatory vari- 
ables of real unfilled orders, real sales and a distributed lag variable. The best, 
ex ante, predictory equation involved real sales anticipations and real, lagged, 
unfilled orders. Although the best ex ante form performed as well as the best ex 
post form, we conclude that ex ante data appear to be marginally useful in 
predicting inventory stock. 


The Data 


The expectational data on inventories and sales are provided by a survey of 
approximately 200 corporations by the Business Rouadup staff of Fortune 
Magazine. We focus here on the manufacturing sector in which twelve industry 
groupings are distinguished. In this paper we have utilized the summary figures 
for each quarter and each industry worked out by the Fortune staff.* For the 
manufacturing aggregate, we have weighted each industry’s inventory data by 
the mean of the ratio, H;/H, for the period under consideration, where #1; is 
industry 7’s inventory stock and H is total manufacturing inventory stock. 
Similarly, sales data for each industry i were weighted by the mean of S,/S. 

The period we are considering begins in the third quarter of 1956 and ends in 





* That is, the replies of the individual firms have been aggregated into industry groupings by the Fortune staff 
and we have taken as given the methods of aggregation. 
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the first quarter of 1959. Although, the Fortune surveys go back to 1954, the 
changes in the questionnaire and aggregation procedures made it advisable to 
begin in 1956 so that we would have a fairly homogeneous set of data. This is a 
sad compromise between the danger of a small sample and the danger of inac- 
curate measures. However, the period is interesting from a cycle viewpoint— 
i.e., it contains two turning points. 

The Fortune surveys provide data on the following variables, among others, 
for the manufacturing groups: 


hi,, industry 7’s two-quarter expectations of the percentage change in its 
inventory stock.‘ 

s{,, industry 7’s two-quarter expectations of the percentage change in its 
sales. 


In addition, supplementary information on other variables was culled from the 
Survey of Current Business: 


P;.., the wholesale price index of industry 7’s products 
h;.z, industry 7’s actual two-quarter inventory change in percentage terms 
8;,t, industry 7’s actual two-quarter sales change in percentage terms 
U,, manufacturers’ unfilled orders 
S;.1, industry 7’s sales 
N,, manufacturers’ new orders 
H;,, industry 2’s inventory stock 
Hj, industry 7’s expected inventory stock. This variable is constructed 
from the Fortune survey data on hj, and data from the Survey of Current Busi- 
ness on H; ;; i.e., 


Hin = (Hie) (hi) + His. (1) 


A similar procedure was used to obtain Sj,. 

The respondents to the Fortune questionnaire were asked to state their antici- 
pations in seasonally adjusted terms, and a graphical test indicated no clear-cut 
seasonal patterns in the anticipations data. The data culled from the Depart- 
ment of Commerce publications are seasonally adjusted, except for U, which 
we adjusted. 


1, PATTERNS IN INVENTORY EXPECTATION 


In Table I data on expected and actual percentage changes in inventory 
holdings are presented. Column 1 contains the aggregate two-quarter inventory 
anticipation in percentage terms for 12 manufacturing industries, each weighted 
by their share of inventory holdings in total manufacturing inventories. Column 
2 contains the corresponding actual changes. In the third column the weighted 
mean difference between predicted and actual inventory changes is presented. 
Finally various statistics appear which are discussed below. 

It appears that the absolute value of the weighted mean difference tends to 
decrease slightly through time, perhaps reflecting the improved coverage of the 
survey and the increased sophistication of the respondents. By comparing the 





* The pertinent question in the Fortune questionnaire that yielded the h;j,; data is: “Taking your inventory as 
of the end of the current quarter (estimated) and assuming no tax cut, what do you expect your inventories will be 
six months from now and twelve months from now (excluding change in prices).” 
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first two columns we can also observe that for ten of the eleven period changes 
entrepreneurs underanticipated actual inventory change. There are several ex- 
planations for this. As an analog to the fixed plant and equipment surveys,’ the 
reason for the systematic understatement may be due to the omission of many 
small purchases for inventory investment. Also, as H. Theil has shown, the 
underestimation of actual changes is more plausible than overestimation when 
certain statistical criteria for the analysis of the discrepancies are satisfied.® 
The systematic understatement of inventory anticipations may be ame- 
nable to the hypothesis that the holding of buffer stocks permit entrepreneursthe 
luxury of holding to a relatively stable inventory change policy in the face of 
marked changes in their anticipations of business activity. To put this another 
way, firms recognize they err in their sales anticipations and thus carry buffer 
stocks.’ 
TABLE I. ACTUAL AND ANTICIPATED PERCENTAGE CHANGES IN 


MANUFACTURERS’ INVENTORIES FOR THE PERIOD 
III, 1956-1, 1959 


! D hi. LD hin Lhe — L hie 


| —2.51 — .36 —2. 
} 














—2.09 —3. d 
—3.13 —6. 3. 
77 —65. 5. 
.53 —2. .. 
-93 —1. 
11 , —2. 
.99 , —1. 
1.25 16 —3. 
.73 ‘ —6. 
1956 III -78 b. —2. 





_ 5.78 .63 —10. 
Mean .6164 .3300 —- ., 
o 3. 














Dd hie = 1.0925 + 1.2371 5 hi, #= 5372 
(.2984) S = 3.7183 


It is interesting to note that when positive changes in inventory investment 
were expected the mean of the weighted mean differences was —2.90 and when 
negative changes were expected the mean of the weighted mean differences was 
—.21. In both cases, actual changes were underestimated but the degree of 
underestimation was larger in the former case. *® 

For the complete period, the mean of the anticipated inventory changes was 





5 See Irwin Friend and Jean Crockett, “Plant and Equipment Programs and Their Realization,” Short Term 
Economic Forecasting, Studies in Income and Wealth, vol. 17, National B of E ic Research, Princeton, 
1955, p. 55. 

* Economic Forecasts and Policy, Amsterdam, 1958, Chapter V. 

1? For an analysis of the buffer stocks motive, see Michael Lovell, Manufacturers’ Inventories, Sales Expectations, 
and the Acceleration Principle, Cowles Foundation Discussion Paper No. 86, December, 1959. 

8 F. Modigliani and O. Sauerlander, (“Economic Expectations and Plans of Firms in Relation to Shcrt Term 
Forecasting,” in Short Term Economic Forecasting, Vol. 17 of Studies in Income and Wealth, Princeton 1955, pp. 
288-9) find that actual sales are underestimated in periods of rising business activity and overestimated in periods 
of falling activity. Although their findings refer to sales expectations, the similarity of this pattern with our findings 
for inventory expectations is striking. 
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— .6164 and for the actual changes, .3300. The difference is: —.9464. The corre- 
lation # between the actual and anticipated changes is .5372. Finally, the stand- 
ard deviation of the weighted mean difference is 3.1281. Although these statis- 
tics indicate a wide dispersion in the errors of prediction made by manufactur- 
ers, the anticipations data appears to be marginally useful when it is deflated 
(see below). 

A problem arises in the initial stages of studies in anticipatory data as to 
whether entrepreneurs express their anticipations in real or money terms. That 
is, it is necessary to establish the units of account in which we are dealing. If 
the entrepreneurs state their anticipations in real terms, they would be re- 
quired to forecast the price level. If they state their anticipations in terms of the 
prices at the time the anticipations are made, they are not subject to a money 
illusion. Even though the question in the Fortune questionnaire on inventory 
anticipations asks respondents to exclude price changes, it is necessary to test 
each of these hypotheses. Thus consider the following regressions for manufac- 
turing: 


>(3*) = 1.1001 + .7440 (>) =. (3) 


(.3922) * 


i it i 


H, HY 
(>) = 8581 + .7898 x(, ) #2 = 3638. (4) 
i 7 t 


(3002) ~ 


The better form, both from the viewpoint of the unbiased coefficient of deter- 
mination and the standard error, is (4) which implies that on the whole entrepre- 
neurs conform to the request to hold static expectations when they state their 
inventory anticipation figures. Thus when we utilize expected inventory stock 
data, the relevant variable is H/ ,/P;,1-2. A similar statement holds for the sales 
anticipation variable. This is not meant to imply that entrepreneurs abstract 
from price expectations when they make their inventory anticipations for de- 
cision purposes (see below).° 

The functions that are to be tested in Sections 2 and 3 are tested on two sets 
of data. One set contains eleven observations, comprised of all manufacturing 
industries for eleven quarters and focuses exclusively on real stock variables. 
The degrees of freedom problem is severe but since researchers who wish to use 
the Fortune data may find only the aggregative data available, the tests using 
the aggregates will provide continuity with their work. A second set of data on 
which the functions are tested utilize percentage changes (not deflated) and con- 
tains either 132 observations (twelve industries for eleven quarters) or 44 ob- 
servations (four industries for eleven quarters). The tests on this continuous 
panel are presented in the appendix. These more reliable results can be used to 
provide supplementary evidence for the inferences derived from the tests of the 
aggregates. Yet, it should be mentioned that the two kinds of tests are not 





® Note that when anticipations data for plant and equipment are interpreted in constant dollars the error in 
anticipations is reduced. See Robert Levine, Plant and Equipment Expenditures Survey: Intentions and Fulfillment, 
Cowles Discussion Paper, 1956, p. 121. 
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strictly comparable since the tests using percentage changes utilize data which 
are not deflated. 


2. REALIZATION FUNCTIONS 


We may begin this section by inquiring into the reasons for discrepancies be- 
tween the actual inventory stock and Fortune’s data on anticipated inventory 
stock for the manufacturing sector. Although no survey data on reasons for the 
discrepancies are available, an indirect method, the testing of realization func- 
tions, is informative. A realization function relates actual inventory stock and 
anticipated inventory stock plus variables that reduce the residual between 
actual and anticipated stock. Realization functions are also useful in providing 
an understanding of the inventory investment process, which can then be used 
in forecasting (see next section). 

An initial problem is the selection of the explanatory variables in the realiza- 
tion function. Consider a firm which attempts to maximize a profits function 
(revenue minus costs) subject to a production constraint. Assume that the firm 
operates in a government free market and is free of subjective uncertainty. For- 
mally, the profit identity is 


wm = PX, — rs — GQ: (5) 
where r; is the price of inventory “inputs” (including storage costs), g; is the 


price of all other factor inputs and Q, is the services of all other factors. Given 
an output constraint 


XxX; _ f(A, Q:), (6) 


then (5) can be maximized subject to (6) in two steps: (1) a minimization of 
costs with respect to H and Q, provided output is given; thus by the method 
of Lagrange multipliers we obtain the equilibrium conditions, r—fy=0 and 
q—fa=0, where fi represents the partial of X with respect to H, etc.; (2) a sup- 
plementary maximization of profits with respect to output. The cost minimiza- 
tion yields the inventory demand relation, 


H = H(r/q, X) (7) 
while we obtain 
X = X(P/q, r/q) (8) 
by the maximization of profits. Combining (7) and (8) yields 
H = H[r/q, X(P/q, r/9)]. (9) 


Thus, with full maximization, the capital demand equation relates inventory 
stock to the ratios of product and factor prices. The full maximization pro- 
cedure of the firm provides two inventory demand equations—one resulting 
from cost minimization (7) and a second from the combination of cost minimi- 
zation and market forces (9). 

Holding relative factor prices constant, (7) becomes 


H = H(a, X) (10) 
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and (9) becomes 


H = H{a, X(P*, a)] (11) 


where a is a constant and P* is the prive level of the final product divided by a 
constant. Thus assuming factor prices constant the cost minimization form re- 
lates inventory stock to output and the profit maximization form relates inven- 
tory stock to the price level. We may rewrite (10) and (11) as 
H = H,(X) (12) 
H = H,(P)” (13) 
Now interpret H as the inventory stock that would prevail at the end of the 
period if the entrepreneur could adjust his production within the period to 
changes in demand in that period provided that we choose the period to be 
not less than a month." But at the beginning of the planning period, the entre- 
preneur must anticipate changes in demand and factor prices, and establish his 


production plans, and then proceed to vary his inventory stock in accordance 
with these plans. That is, his anticipated inventory stock function would be 


an H, = H,(an X,) (14) 
under cost minimization, and 
an H, = H,(an P,) (15) 


under profit maximization. The differences between his anticipated inventory 
demand and what he would do if he adjusts his production rapidly is 


H,— an H, = H,(X,) — Hi(an X,) = A(X, — an X)) (16) 


H, — an H, = H,(P,) — H.2(an P,) = H2(P; — an P,). (17) 

We may write (16) and (17) as 
H, = H\(X, — an X, an H,) (18) 
H, = HAP; — an P,, an H,). (19) 
These two realization functions assume constant factor prices and perfect 
competition. Since we shall employ linear least square regressions to test them, 
it is assumed in addition that the production function underlying (18) and (19) 


is linear and homogeneous. We use anticipated sales as a proxy variable for 
anticipated output. Since data for an P,; is unavailable the anticipated price 





1° For a fuller discussion of this type of investment theory, see Murray Brown, “Profit, Output and Liquidity 
in the Theory of Fixed Investment,” International Economic Review, January 1961, pp. 110-21. 

't The assumption of a rapid production adjustment is supported by Modigliani and Sauerlander, “Economic 
Expectations and Plans of Firms in Relation to Short-Term Forecasting,” Short Term E. ic For ting, op. cit. 
p. 350. They find that quarterly inventory holdings are positively related to the expected quarterly sales change 
but less closely related to actual change in sales; “ . . . firms manage to revise production sufficiently to take care of 
unanticipated changes in sales, but not sufficiently to change end inventories significantly to the level originally 
planned.” In a later article, Modigliani suggests that the speed of adjustment is a function of the diff between 
desired and actual inventories; See Franco Modigliani, “Business Reasons for Holding Inventories and their Macro- 
Economic Implications,” Problems in Capital Formation, Studies in Income and Wealth, Vol. 19, National Bureau 
of Economic Research, Princeton, 1957, pp. 499-506. 
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change is approximated either by (P;— P:-1) or (P:— P:-2), i.e., we assume static 
price expectations. 

However, by assuming an P,;=P,_, or an P;=P,_», we are not really testing 
(19). For, P:—Piu, e.g., may be interpreted as a price hedging term if manu- 
facturers do not hold static expectations and/or if they correctly anticipate 
prices so that an P,;=P,. The closest we can approach a test of (19) is by a 
price hedging term. Since this requires the crucial assumption of static price 
expectations, it is not certain that the price hedging form requires full profit 
maximization. For notational convenience, let H*= de (H;../P;.2) and 
H= >: (Hi1/Pi,t-2). 

The results of the tests are: 


Si, Si, 
H* = — 1.2989 + 1.2860H + 0106 ( - -> . “) 
t t-2 


(.4131) (.0063) 


2.9637 + .1914H + .3315(P; — Py.) 
(.1552) (.0481) 


2.9307 + .1992H + .3392(P, — P,-2) (19”) 
(.1604)  (.0440) 


R? = .8946 
S= .0829 
6?/S? = 1.6535" 


The 6?/S? statistic tests whether the assumption of nonautocorrelated residuals 
is warranted. For our study, values in the neighborhood of 2.2 satisfy the as- 
sumption of nonautocorrelated residuals."* Comparison of the results leads us 
to believe that the price hedging forms perform better, since the sampling errors 
are smaller, R? is greater and there is less serial correlation in the second or 
third equations. Even though (19’’) displays less serial correlation than (19’), 
the standard errors of the two equations and the coeffivients of the two equa- 
tions are not significantly different; therefore there is no basis for choosing be- 
tween them. Let us again mention the caveat that we are not certain that the 
full maximization forms are superior to the cost minimization form, since the 
assumption of static price expectations may be unwarranted. 





2 H*, TT, DS,4/P: and BS4/Pt_2 are measured in ten billions of dollars. All equations in the text are tested 
on eleven observations. 

’ B. I. Hart and John Von Neuman, “Tabulation of the Probabilities for the Ratio of the Mean Square Suc- 
cessive Difference to the Variance,” The Annals of the Mathematical Statistics, XIII (June 1942), 207-14. 
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We have used (P,—P,-4) and (P,—P,_2) as a proxy for (P;—an P,) in (19) 
and (19’’). Although this may be justified under the assumption of static expec- 
tations, if we admit of the possibility of imperfect competition, different be- 
havioral relations are required. Thus consider a firm in an oligopolistic market 
which adds to its backlogs rather than raises prices when demand pressures 
its capacity. We would expect this firm, then, to be oriented toward a backlog 
form of the realization function, i.e., it would focus on the unfilled orders fac- 
ing the firm as a means of equating ex ante and ex post inventory stock. 

The unfilled order variable applies only to an oligopolistic or imperfectly 
competitive firm and cannot be interpreted as a proxy for future demand for a 
firm in perfect competition. To show this, assume an increase in demand facing 
an industry; if each firm chooses to add to its order books and not raise price, 
the market price remains constant; this violates an assumption of perfect 
competition that no firm can influence price. 

An “oligopolistic” realization function might then take the form: 


H* = H;[(Hi/Pi.0), Uie/Pial, 


where U;,, is the level of unfilled orders at ¢t. The results of a linear least squares 
test are 


we U; 
H* = — 3040 + .7746H + .2528 >> (=) 


tt 


(20) 


1472) (0811 a 
a R? = .9027 


= .0082 
§2/S? = 1.4505. 


By inspection of the R? and 62/S? statistics we may say that there is a suggestion 
that an oligopolistie version of the realization function may be preferable to the 
cost minimization realization function, (18’). However, experiment with the 
first differences in U;,,/P;,, and a pressure on capacity variable, U;,:/S;,1 (see 
below), proved to be relatively unsuccessful. 

Realization functions which experiment on a continuous panel of manufac- 
turing firms appear in the Appendix. Using the same variables as in equations 
(18), (19) and (20) but transformed into percentage changes renders support 
to some of the conclusions reached on the basis of the aggregate tests using de- 
flated stock variables. The percentage change tests clearly indicate the superi- 
ority of the oligopoly realization function (20) over the cost minimization form 
(18). The tests in the appendix also show that the coefficient of the price hedg- 
ing terms does not significantly differ from zero at the .05 level. This is consist- 
ent with the findings of M. Lovell:““ . . . There is no support for the hypothe- 
sis that manufacturers speculate in stocks . . . .” Therefore, we can discount 
the importance of the price hedging equation. 

This discussion of realization function sheds some light on the reasons for the 
discrepancies between actual and anticipated inventory stock. In addition, it 





“4 Op. cit., p. 11. 
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provides a theoretical basis for the use of ex ante and ex post variables in fore- 
casting inventory stock, which will be discussed in the following section. 


3. EX ANTE AND EX POST DATA IN FORECASTING INVENTORY STOCK 


As noted above, the difficulty in predicting inventory stock and inventory 
investment by means of ex post variables has induced organizations to utilize 
ex ante data from surveys for this purpose. The general question arises as to 
the predictive ability of ex ante and ex post variables. In this section we shall 
take up the problem of forecasting inventory stock and present results of four 
types of predictory equations: ex ante-simple (the use of one ex ante explana- 
tory variable), ex ante-multiple (two explanatory variables, one ex ante and 
one ex post, which are suggested by the realization functions tested in the pre- 
vious section), ex post-simple (one ex post explanatory variable) and ex post- 
multiple (two ex post explanatory variables). 

Ex Ante-Simple Forecasting Forms 


H* = .8581 + .7898H (21) 
(.3092) 


0225 + 1.9025 > (Si,+/Pr-2) 
(.2871) 


= 2.0134 + 1.7034( 58%.) 1” 
(1.0954) 


M tH; = 


Comparing the coefficients of determination, the standard errors and the 
sampling errors of the three forms, we see that the two-quarter sales anticipa- 
tions deflated by prices ruling in the period when the expectations are made 
clearly outperforms inventory expectations. Although the percentage change 
variants of (21) and (22) presented in the appendix do not perform strikingly at 
all, a comparison of their unbiased coefficients of correlation indicates a slight 
superiority of the sales change anticipations, thus supporting the results of the 
aggregative tests. An attempt to test a variant of Whitin’s theory, equation 
(23), was not successful."® 

Since one would expect high collinearity between expected inventory stock 
and expected sales, the unequivocal superiority of (22) over (21) requires some 
explanation. However, we have no direct data bearing on this “anomaly” and 
thus can only offer tentative hypotheses. In the first place, it is known that the 
comparatively rapid adjustment of inventory stock to economic changes in 
normal periods in which no bottlenecks occur is not difficult. Entrepreneurs 
may feel that a detailed scrutiny of inventory anticipations is not as warranted 
as a severe scrutiny of the sales variable, upon which much more than inven- 





& T. M. Whitin, The Theory of Inventory Management. Princeton, 1953, p. 32. 
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tory policy is based. This does not imply that inventory anticipations are not 
important in predicting inventory stock (recall that the #* of equation (21) is 
.3638); it does imply that the sales anticipations are more important in the 
profit goal and that there is a non-constant relation-between expected inven- 
tory stock and expected sales. Another reason for the superiority of (22) over 
(21) may be that the observation errors associated with the inventory variable 
generate a large residual error in the relationship between actual and antici- 
pated inventory stock. That is, the well known difficulties in measuring inven- 
tory investment, it being a residual, may cause part of the large residual error. 
Thus anticipated sales tend to predict actual inventory stock better than antici- 
pated inventory stock because, not only are inventory anticipations subject to 
the usual error in evaluating future conditions as are sales anticipations, but in 
addition they are subject to larger measurement error than is the anticipated 
sales variable. 


Ex Ante Multiple Forecasting Forms 
U; 
H* = 1.1884 + .40127 + .2616 >> >) —1 


(.0903)  (.0231) _ 


Ss ie 
§2/S? = 


N; 
— 2.1073 + .8699H + 1.0396 5 (= )e —1 


(.0597) (.0640) 


Si. U; 
1.0643 + .9962 >> (—“) + .1905 © ( 7: 


: i,t—2 . ry 
(.1323) (.0204) m 
5 = 
6*/S? oan 

S; N; 
= — 4942 + 1.5383 >> (= : ) + sD (> -1 


t,t—2 


(.2063) (.1324) 


R2 
5 => 
5?/S? 


, 
S; 


= — .1830 + 80205 ( “) + .5720H*t — 1 
i,t—-2 


(.7444) (.3891) 
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H* = 1.1591 — .0798H7 + 1.5141H*t — 1 
(1982)  (.1573) 


Si 
6146 — .0279H + 2.1803 >> >) —2 (30) 


2840 4283 ‘ a 
( ( ) R? = .8100 


S = .1171 
6°/S? = 1.4587 


The predictory forms (24) and (25) are suggested by the realizations func- 
tions tested in the previous section. Since new orders are defined as current sales 
plus the change in unfilled orders, it is a plausible “oligopoly” variable. Yet the 
functions among which a choice of forecasting ability should be made are (24), 
(25) and (26); and since equation (26) appears to generate nonautocorrelated 
residuals while having a high coefficient of determination and significant pa- 
rameter estimates, it appears to be the best ex ante predictory form which was 
tested. Experiments on the continuous panel in the appendix do not invalidate 
this statement. They tend to show that unfilled orders coupled with either in- 
ventories or sales anticipations are more highly correlated with actual inven- 
tories (all in percentage change form) than the other variables in (25), (27), (28), 
(29) and (30). However, the result in the appendix does not permit us to choose 
between (24) and (26). 

Equations (28) and (29) are distributed lag forms in the explanatory varia- 
bles >> (Sj, '/Piz-2) and H, respectively. They are obtained by means of a 
“Koyck transformation,” which assumes that the parameters can be approxi- 
mated by a geometric series. Although, it cannot claim to be a good ex ante 
predictory equation for our time series,'* the technique of distributed lags has 
potential value as indicated below. 





#L, M. Koyck, Distributed Lags and Investment Analysis, North Holland, 1954, p. 22. We may apply the 
Koyck transformation as follows. Let the infinite distributed lag be 


* 
a) H, = = agus. 
1 


If we assume that the a; can be approximated by a geometric series, a; =Na,(0 <A <1) and (a) becomes 


b) Ht = > ax; 
imi 


i = > ajnix4_2, 
2 
multiplying by \ and subtracting from (b) we obtain 


* * 
ec) H, = 4,1 + Hi 


which we may test by linear least squares. Since a stock variable ie highly autocorrelated, it is important for us to 
test the assumption underlying least squares of nonautocorrelation in the residuals; thus we shall weigh the 6S? 
statistic heavily in evaluating the distributed lag forecasting forms. For a discussion of serial correlation under the 
Koyck transformation, see L. Klein, “The Estimation of Distributed Lags,” Econometrica, October 1958, pp. 
553-65. . 
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Ex Post Simple Forecasting Forms 

H* — .0487 + 1.0037H 1 8101 (31) 

(.1523) = 4.2736 

1155 

2.4694 + 0.3564 > ( sf )e -1 9322 

(.0300) : = 0682 
1.8459 + 0.9637 >> (=) t—1 2= 4554 
(.3150) S= .1940 


H* = .0820 + 1.8843 >> a t—9 2 = .7908 
(.0303) rs S = .1203 

>Hi; = .858 + 1.588 50(S,) 1-2 .6238 
(1.0441) Msn, = 4.6491 

5 = .1817 


Equation (31) is an autoregressive relation involving a one-quarter lag. It is 
presented here for purposes of comparison, since the variable }>; (H./P,)1-1 is 
employed in multiple relations. Equations (32) and (33) employ unfilled orders 
and new orders, respectively. Notice that in the simple forms, unfilled orders 
perform better than new orders—a result which also holds for certain multiple 
relations below and for the percentage change tests in the appendix. Note also 
that the coefficient of + (N,/P.):-1 is barely significant at the .05 level. Equa- 
tions (34) and (35) are the familiar inventory prediction equations, which lag 
sales six months behind inventory stock. The deflated version is superior, al- 
though the size of the coefficient on lagged sales is unexpectedly large. It ap- 
pears that equation (32) is the best of the ex post-simple forms tested; this ap- 
plies to the percentage change test, also. 


Ex Post-Multiple Forecasting Forms 


U; 
H* = 1.0466 + .1862 > >) —1+ 5262 Hy-1 


(.0260) (.0910) 


U; 
6398 + .9804 >> ( t— 1+ .3252H)-» 


(1503) | (.1600) 
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S; N; 
H* = — 4896 + 1.5247>- >)! —2+ sus (>): -1 (38) 


a 


(.1566) (.1926) (.1241) ‘ R? = .9311 

5 = .0693 

52/S? = 1.7702 
= 1.4022 + m3X(F (24 200 D(S*)e-1 (39) 
(.1808) of (0317) R? = .9740 


5 = .0469 
62/S? = 2.1241 


The variable, >°(U;/S,):-1, in (37) is an index of pressure on capacity. Thus 
even though this equation is not the best fit of the ex post-multiple form it sug- 
gests that a capacity pressure variable may be important in influencing the 
level of inventory stock. Future tests of this hypothesis may find that, just as 
in the field of fixed investment, a pressure on capacity variable is operative in 
determining inventory investment in certain special time periods. 

Equations (38) and (39) incorporate the familiar lagged sales variable with 
new orders and unfilled orders, respectively. Equation (39) is the better of the 
two judging by the R? and 62/8? statistics. It is difficult to offer an explanation 
for this finding because no unambiguous theory of the role of new orders and 
unfilled orders in the decision process of the firm is available. 

If a choice of predictory equations of ex post-multiples had to be made, it 
would be between equations (36) and (39). Since there is no significant differ- 
ence between the two in terms of their reliability statistics (this result holds 
also for the percentage change tests), we can suggest that, at the minimum, real 
unfilled orders, coupled with a distributed lag term and/or lagged sales, is use- 
ful in the prediction of inventory stock. 

The question arises as to the amount of variation explained by the ex ante 
variables over that explained by the ex post variables. It is clear that in equa- 
tion (26) there has been a signifieant addition, since #H*: >> (U;/P,):1=.9322, 
whereas the R? of (26) is .9817. Furthermore, Sx*: >>(U:/P,):1=.0682 and S$ 
of (26) is .0036. Yet supplementary evidence from the tests on the continuous 
panel indicate that the increase in variation explained by the addition of the ex 
ante variables is negligible: #? of (32a) is .6882 where the R? of (26a) is .6914. 
In this connection, we note the result of another study which found that “the 
procedure . . . for obtaining a surorgative measure of expectations provides a 
closer prediction of actual inventory behavior (than Railroad Shippers Fore- 
cast Data).”'!7 We may conclude that the Fortune ex ante variables provide only 
marginal gains to the prediction of inventory behavior. However, the anticipa- 
tions data may become more useful in the future as observation error is reduced. 
(See discussion above on the time trend in the weighted mean difference of 
actual and predicted inventory changes). 





17 M. Lovell, op. cit., p. 14. 
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APPENDIX 


Table II contains the results of regressions of manufacturing industries’ 
actual two-quarter percentage change in inventory stocks. Each variable is 
measured in two-quarter percentage change form. The relations are linear. The 
standard errors of the regression coefficients appear beneath each coefficient in 
parentheses. The regression codes refer to a corresponding aggregate estimating 
form used in the text; thus (18’a) has the same variables as (18’) except that 
variables in the former are in percentage change terms. 

For these regressions, we have used either 132 observations (twelve manu- 
facturing industries for 11 quarters) or 44 observations (4 manufacturing in- 
dustries for 11 quarters). The four industries are primary metal, fabricated 
metal, machinery (including electrical), and transportation for which an indus- 
try breakdown of data on new orders and unfilled orders is available. 

These results are discussed in the text. We note here that the results in Table 
II support for the most part the conclusions reached in the text on the basis of 
aggregative tests. However, tests in Table II provide no support to the price 
hedging term and assign very little importance to the ex ante variables in de- 
termining inventory investment. 


TABLE II-A. REGRESSIONS WITH DEPENDENT VARIABLES hj, i.e., 
THE ACTUAL TWO-QUARTER PERCENTAGE CHANGE IN 
INVENTORY STOCK FOR MANUFACTURING INDUSTRIES, 

III Q 1956-I Q 1959 


Regression Coefficients 








p Number of 
8 — 85 ; ; ; Observa- 
tions 








-0442 132 


. 1946 
(.2051) 


. 1284 
(.0800) 


4024 
(.2232) 


- 1662 
(.205) 





- 1066 
(.0795) 


- 2239 
(.2040) 



































EX ANTE AND EX POST INVENTORY DATA 


TABLE II-B. REGRESSIONS WITH DEPENDENT VARIABLE 4h,;, i.c., 
THE ACTUAL TWO-QUARTER PERCENTAGE CHANGE IN 
INVENTORY STOCK FOR MANUFACTURING INDUSTRIES, 

III Q 1956-I Q 1959 


Regression Coefficients 











-0660 
(.1245) 


2881 
(.1556) 


— .0127 
(.0427) 


-2337 
(. 1346) 






































AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1961 


TABLE II-C. REGRESSIONS WITH DEPENDENT VARIABLE A,;, i-e., 
THE ACTUAL TWO-QUARTER PERCENTAGE CHANGE IN 
INVENTORY STOCK FOR MANUFACTURING INDUSTRIES, 

III Q 1956-1 Q 1959 


Regres- 
sion 
Code 


3la 


Regression Coefficients 


1555 | .4361 
.4806) | (.0738) | 
| 


7311 | .5092 
.8516) | (.1224) | 


4411 
. 4219) 


.6242 
.9523) 


1489 | | .2619 
.4600) | (.0524) | 


.2091 
(.0844) 


- 2265 


(.1213) 


.0265 


(.0789) 


Number 
of 


Obser- 


vations 


1017 
(.0655) | 


. 2855 
(.0663) | 


3406 
(.0654) 








. 6882 
} 


| -7185 


.6901 





ON AN INDEX OF QUALITY CHANGE* 


Irma ADELMAN 
Stanford University 
AND 
Zv1 GRILICHES 
University of Chicago 


This paper discusses a multiple-regression chain-index technique for 
the construction of an index of quality change, and illustrates its feasi- 
bility by reference to a price index for automobiles. The interaction of 
our quality index with the usual Laspeyre type price index is studied. 
It is demonstrated that the quality index suggested bears the same re- 
lationship to a constant-satisfaction index as does an ordinary price 
index, and that this index is an appropriate deflator for the CPI. 


1, INTRODUCTION 


HE great concern over rising prices in recent years has intensified interest 
emer economists in the use of price indices as measures of inflation. It is 
generally conceded that observed changes in price reflect not only inflationary 
forces, but also variations in the quality of products available in the market 
place. In the construction of both price and quantity indices, therefore, adjust- 
ments for changes in the quality of items included in the index are sometimes 
made. These corrections, however, are not as systematic as one might desire 


and are rather arbitrary in nature. 

Generally speaking, three approaches to the preblem of adjustment for 
changes in quality are in current use. The first technique is to subdivide each 
commodity into as many separate products as are deemed necessary in order 
to reflect adequately existing quality differentials, and treat each different 
subdivision as a separate commodity in the index. One disadvantage of this 
procedure is that, unless combined with a random sampling method,' the num- 
ber of items which must be included in the index becomes quite large. Further- 
more, this approach aggravates the problems raised by the emergence and dis- 
appearance of certain varieties of products between weight revision periods. 

A second method, involves a combination of specification pricing and splic- 
ing. With this technique, the movements through time of one or several? nar- 
rowly specified varieties of a single product are recorded as long as possible. 
When a variety disappears from the market, another grade is substituted for it. 
The price of the new brand is spliced into the index assuming either price or 
quality to have remained constant. 

The philosophy underlying this approach is well described in the technical 





* This paper is an amalgamation of two independently conceived manuscripts. The theoretical sections were 
written by Mrs. Adelman. The empirical portions are based on work by Zvi Griliches. Mrs. Adelman’s work was 
supported by a Ford Foundation grant. Mr. Griliches’ research was undertaken during his term as Research Asso- 
ciate at the NBER and supported by a grant from the National Science Foundation. We are indebted to Mr. 
M. Nerlove for his helpful comments. 

1 This proposal was discussed in Irma Adelman, “A New Approach to the Construction of Index Numbers,” 
Review of Economics and Statistics, v. XL, No. 3 (Aug. 1958), 240-9. 

? The degree and direction of bias are not independent of whether a single item or the average price of a group 
of items is used. For a discussion of this point see E. V. Hofsten, Price Inderes and Quality Change (Stockholm: 
Bokforlaget Forum, 1952), pp. 50-7. 
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notes explaining the procedures followed by the Bureau of Labor Statistics in 
the construction of the Consumer Price Index.’ 

“The Bureau attempts to price goods of constant quality from period to 
period, so that the index will reflect price changes only. To accomplish this, 
rents are compared on identical units from period to period; for other groups, 
detailed specifications have been written for the item for which prices are ob- 
tained... . 

“Prices are obtained for identical articles as long as they are available in re- 
tail stores. When the Bureau’s representatives can no longer obtain prices for 
a given article, they must substitute another. 

“Substitutions are of two types: (1) Substitution of another article which is 
adequately described by the same specification, and (2) substitution of an 
article serving the same purpose, but not of the same quality, and described by 
a new specification. 

“In the first type, any difference in price between the old and new article is 
shown as a price change in the index calculation. For example, if one brand of 
men’s shirts is no longer available and another brand of substantially the same 
quality is substituted, the difference in price is allowed to affect the level of the 
index. In the second type of substitution, the level of the index is not affected, 
for the new article is introduced by a linking process. An example of this type 
of substitution is the replacement of silk hose by nylon hose during World 
War II.” 

For substitutions of type one, then, the BLS procedure implies (probably 
correctly) that identically specified items are of the same quality. That is to 
say, that the list of specifications used has adequately captured all quality di- 
mensions of the product in question. Therefore, any price variations found be- 
tween identically specified goods are pure price changes and should be allowed 
to influence the value of the index. However, substitutions of this kind are not 
always possible. Furthermore, the line of demarcation between substitutions of 
type one and substitutions of type two is somewhat arbitrary. For the answer 
to the question, “How many specifications must be different before an item is 
‘differently specified’?” is not unique and depends on the observer’s assessment 
of the importance of the nature of the difference in specifications. 

The treatment of substitutions of the second type, is, unfortunately, rather 
open to question. For the splicing procedure used in this case implies that the 
entire price differential between differently specified items is attributable to a 
difference in quality. Consequently, as pointed out by Hofsten,‘ to be correct 
this method requires that the ratio of the prices of the two commodities be 
equal to the ratio of their qualities, as measured by their marginal rates of 
substitution. In a purely competitive market, price quotients between varieties 
of goods available for sale simultaneously in the market would indeed tend to 
adjust themselves so as to ensure the satisfaction of this condition. However, 
since substitutions of type two are made only for products no longer available 
for purchase, no mechanisms exist to guarantee the equality of price quotients 
with marginal rates of substitution even under ideal purely competitive con- 





* “Construction of Consumers’ Price Index, Technical Notes,” Monthly Labor Review, vol. 69, No. 3 (Sept. 
1949), 285-6. 
* Hofsten, op. cit., p. 50. 
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ditions. The “no true price differential” assumption employed in this type of 
splicing procedure is therefore purely arbitrary. 

As pointed out by Ulmer, however, splicing may be legitimate for purely 
competitive commodities available for sale concurrently provided that substi- 
tutions are made at a point in time “at which it appears that for the mass of 
consumers @ choice between the new product and the old are at the prices then 
prevailing has reached a condition approaching indifference.”*® Merely to state 
the requirement, though, is to realize how difficult it would be to satisfy in 
practice. Furthermore, if the conditions enumerated were in fact met, the 
splicing technique would actually be unnecessary. 

The approach recommended by Hofsten® is a variant of the splicing pro- 
cedure. He suggests that whenever quality change takes place during normal 
periods the formula 

) 
P1 


Po 


(1) 


1 
In = — 
g 


should be applied to evaluate the index. In this formula, p{ and p§ are, re- 
spectively, the price of the new good in period 1 and that of the old one in 
period 0, and g is the marginal rate of substitution between the new and old 
products. However, while this method is theoretically correct under the qualifi- 
cations already discussed it is of little help in practice in the absence of con- 
crete procedures for evaluating “g.” 

A third method currently employed for quality adjustments is to change the 
basis of price or quantity quotation from the unit of sale to a yardstick more 
nearly relevant to its intrinsic utility. For instance, the cost of electrical gener- 
ating equipment might be evaluated per unit of capacity,’ the price of milk 
quoted per unit of butterfat content, that of alcoholic beverages in terms of 
alcoholic strength,* etc. This approach represents an attempt to grapple with 
the quality problem in a somewhat more direct fashion than either of the pre- 
ceding methods. As such, it is a step in the right direction. However, even if a 
“correct” yardstick were selected, it is unlikely that the quality of a product 
could be captured in such a unidimensional fashion except in a very limited 
number of cases. And, if more than one quality trait must be considered, the 
problem of choice of weights to be assigned to various quality dimensions must 
be solved. 

A fourth approach which has been suggested (but never applied) is the use 
of chain indices.* For each link in the chain index, a different fixed weight index 
(of the Laspeyre type, for example) is computed. Comparisons over longer 
periods are made by chaining. For instance, if 2;; is a jth year price index with 
year 7 taken as base, then 


os = Wo Wi2" Tos. (2) 





5M. J. Ulmer, The Economic Theory of Cost of Living Index Numbers. New York: Columbia University Press, 
1949, pp. 67-8. 

¢ Hofsten, op. cit., pp. 107-8. 

7 F. de Leeuw, “The Measurement of Quality Change,” Pr dings of the Busi and E. ic Statistics 
Section, American Statistical Association, 1958, pp. 174-83. 

8 R. Stone, Quantity and Price Indexes in National Accounts, Pavis: OEEC, 1956, Ch. 4. 

* F. Divisia, “L’indice monétaire et la théorie de la monnaie,” Revue d' économie Politique, 1925, pp. 842-1004. 
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With this procedure, the period over which the basket is held constant is rather 
short. Therefore, since, under normal conditions, changes in price, quantity, 
and quality are all gradual, the errors introduced into each link of the chain by 
assuming quantity and quality to be fixed are minimized. However, as pointed 
out by Keynes," the cumulation of even small successive errors may result in 
a substantial overall bias over long periods of time if certain persistent trends 
towards say, quality improvement exist in the economy. 

The technique recommended in this paper combines a “direct” method of 
estimating the impact of differences in quality with a chaining procedure. As 
far as we are aware, this procedure was first suggested by A. T. Court in 1939," 
but in a somewhat different context and with very little theoretical justification. 
A more recent and elaborate discussion of this method, in the context of quan- 
tity indexes, was presented by R. Stone in 1956." 

In general, the question one would like to have answered in order to allow 
for changes in quality is, “How much money (ceteris paribus) would have to be 
given to or taken from each consumer in order to compensate him exactly (in 
terms of utility or satisfaction) for the quality changes which have taken place 
since the base period of the index?” This is, of course, a question which has 
neither a unique nor an observable answer. But suppose one asks instead, “How 
much additional money would the average consumer have to pay in the base 
year in order to get a basket of goods identical with the one he purchased in the 
base year, except that the qualities available are those of the given year?” 
Then, as we shall show, a workable index of quality can be constructed. We 
shall prove further, from an examination of basic utility theory, that this index 
is related to the answer to the first question in much the same way as the BLS 
consumer price index is related to the corresponding “ideal” price index. The 
interaction of this quality index with the usual type of Laspeyre price index 
will also be studied. 

The major advantage of an index of this nature is that it could be used in 
conjunction with the BLS consumer price index (or with any other price index, 
for that matter) in order to evaluate the extent of non-quality induced price 
movements as a function of time. The existence of such an index would there- 
fore permit a more realistic appraisal of long term trends in the economy. It 
would also materially assist the economist in his investigation of the system- 
atics of the pace of quality change and its relationship to the business cycle. 


2. QUANTIFICATION OF QUALITY CHANGE 


Before we proceed any further, it would be desirable to define the notion of 
“quality,” as applied to a specific product. Consider, for example, the com- 
modity “milk.” This product is marketed in several grades, each of which dif- 
fers in butterfat content, vitamin content, perishability, etc. While one could, 
in principle, define each grade of milk to be a separate good, such a procedure 
would require us to consider an extremely large number of separate com- 
modities, most of them with only a few observations. 





10 J, M. Keynes, A Treatise on Money. New York: Harcourt, Brace, 1930, v. 1, pp. 118-9. 

u A. T. Court, “Hedonic Price Indexes with Automotive Examples,” in The Dy ics of Automobile Demand. 
New York: General Motors Corporation, 1939, 

2 Stone, op. cit. 
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Therefore, we propose that the quality of a commodity be regarded as a 
composite of a number of different characteristics. Each trait will represent a 
particular quality dimension of the item in question. The entire list of these 
characteristics, plus one indicator of each, constitutes the quality specification 
of the product considered. In the case of milk, for instance, the fat content, 
the vitamin content, and the perishability are some of the quality character- 
istics. The fat and vitamin contents can be indicated by the weight of each item 
per quart, and the perishability can be measured in terms of the maximum 
number of days a quart would remain sweet under some well defined storage 
conditions. Each grade of milk will then be characterized by a particular value 
of each of these indicators. Note that, in this process, the subjective notion of 
the “quality” of a particular type of milk has been quantified by a specific 
corabination of objectively measurable or rankable traits. With this definition 
of quality, to any given commodity 7 there will correspond n; different quality 
dimensions a;;. Naturally, n;, the number of quality specifications will be dif- 
ferent for each commodity, and the value assumed by each a,; will vary with 
time as well as with the grade of the good. 

In order to intercompare different varieties of products, we assume that the 
change in price of the ith commodity dp;, can be decomposed into two distinct 
additive components: 


Opn eg Rtg (3) 

3 Odi; 
On the right hand side of this equation, the first term, dp;, is that price move- 
ment which would have occurred in the absence of quality variations. In the 
remainder of this paper we shall refer to dp!, as the polygenetic price change, 
for obvious reasons. 

The second term represents the combined effect of those price movements 
which are due solely to changes in quality. Note that this measure of “quality 
change” is based only on those “qualities” for which a price is being paid or 
exacted, and only to the extent of the price differential. If these price differen- 
tials are “phony” or “too high” or “too low” from some omniscient point of 
view, the index will not take this into account. In fact, it may not take into 
account some aspects of “quality” which may be important, and incorporate 
other “imaginary” qualities such as brand names whose “superiority” over un- 
branded items would be denied by many people. Thus, if we observe that gar- 
ments bearing one union label sell on the average at a 5 per cent higher price 
than comparable unlabelled items, and also that garments bearing the labels 
of three different unions sell for 15 per cent more than comparable unlabelled 
items, we would predict that if a similar garment were available with two 
union labels, it would probably sell for about 10 per cent more than the un- 
labelled items. And we would use this in calculating our price index (or price 
relative) for the two-label-garment even though we are morally certain (and 





48 Thie approach was first scggested by H. Brems, Product Equilibrium under Monopolistic Competition. Cam- 
bridge, Mass.: Harvard University Press, 1951. 

4 This assumption is similar to that of H. Houthakker, “Compensated Changes in Quantities and Qualities 
Consumed,” Review of Economic Studies, v. 19, (1951-2) pp. 155-64. 
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supported in this by extensive test laboratory findings) that there is no “real” 
quality difference among all these items. We would do this since we are answer- 
ing only the relatively modest question: “What would the price have been if it 
were available?” rather than, “Would the consumers be ‘right’ in paying this 
particular price?”. Once raised, the doubt whether the evidence of the market 
place reflects adequately (if at all) the “true” marginal utility of different items 
or qualities to the consumer, can be turned against any other price or com- 
modity. It is not a problem peculiar to the measurement of “quality.” 

While it is not necessary for our purposes, it would be nice, however, if these 
quality adjustments represented something “real” and not just the idiosyn- 
crasies of manufacturers’ pricing policies. We shall show below that, for con- 
sumers in a state of equilibrium expenditures upon changes in various quality 
characteristics are in proportion to the increase in satisfaction generated. For 
markets in which prices are free to fluctuate, this implies that the equilibrium 
values of 0p;/da;; are “correct” from the point of view of the consumer. For 
markets in which prices are administered, the dp;/da;; may be too high or too 
low relative to their quality content. But this does not matter. For, as shown 
below, the weight with which each quality adjustment is introduced into a 
price index is proportional to the quantity of the good which is purchased. And 
the result of “overpricing” quality differentials will be to lead to a decline in 
consumer purchases of the item in question. 

Since dp;/da;; da;; is precisely the change one would observe in the price of 
the ith commodity if its jth quality index were increased by a small amount 
da; (ceteris paribus), (3) offers a scheme for converting small change of quality 
(which are incommensurable) into corresponding price movements (which are 
commensurable). In order to do this, of course, one must find some way of 
deriving or estimating the dp;/da;;. We suggest for this purpose, a technique 
first used by Court in connection with price adjustments for quality variations" 
and later applied by Stone" to the estimation of quality adjustments to indices 
of quantity. Once a set of indicators a;; for the various quality dimensions has 
been selected, a multiple correlation analysis on cross-section data can be used 
to evaluate the required partial derivatives. For each item (defined by a brand 
name, grade, and/or other distinguishing designation) the input to the analysis 
consists of an average price and a set of values of the appropriate a,;. The par- 
tial regression coefficients of p; on a,; are related to the dp;/da,; in a manner 
which depends on the precise form of the regression equation chosen. If, for 
example, a linear relationship is postulated between p; and the a,;, the partial 
regression coefficients are precisely the dp,;/da;,;. Similarly, if log p; is assumed 
to be linearly related to the a;;, dp;/da;; will be the partial regression coef- 
ficient multiplied by p;. A more complex form can, of course, be used for the 
regression analysis, resulting in a more complicated relationship between the 
regression coefficients and the dp,/da;;. Indeed, there is no requirement that all 
the regression surfaces have the same functional form. There is, in fact, no 
a priori reason to assume that a given p; depends in the same manner on all the 
appropriate a,;. Naturally, only those quality dimensions whose coefficients are 





% Court, op. cit. 
% Stone, op. cit. 
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statistically significant should be retained for the quality index. Thus, a sig- 
nificance test for each 0p;/da;; will provide a cross check upon the choice of 
the ij. 

In view of changes in technology and in consumer tastes the dp;/da,; will, in 
general, tend to vary over time. They should therefore be recomputed periodi- 
cally. 

An example of the derivation of the ép;/da;; is provided by an analysis of the 
prices (new) of 95 different automobile models in 1957 and their relationship to 
selected specifications (dimensions or qualities).'7 After some experimentation, 
the following relationship was found to describe the data best: 


log. P = constant + .117H + .135W + .039L + five other dummy variables 
(.030) (.045) (.013) R? = .966 


where P is the list price of these cars, H is their horsepower (in units of 100), W 
their weight (in 1000 lbs.), L their length (in tens of inches) and the “other 
variables” are dummy variables for V-8 vs. six cylinder engines, hardtops or 
not, inclusion in the price of automatic transmissions, of power steering, and/or 
of power brakes. The numbers in parentheses are the estimated standard errors 
of the coefficients. The dummy variables (variables that take on the value of 
one if an observation falls within a particular class and zero if it does not) were 
introduced to stratify the sample and control for some of the other attributes 
of cars that change with changes in horsepower or size. The contribution of 
each of these dummy variables, except the V-8 one, was statistically signifi- 
cantly different from zero at conventional significance levels. Since the form 
of the equation is semilogarithmic, the coefficients provide us with an estimate 
of dp;/da;;/p;, the percentage change in price due to unit change in a particular 
quality, rather than just of dp/da;;. Thus the estimated coefficients indicate 
that on the average an increase of 10 units in the horsepower of an automobile, 
holding other qualities constant, was associated in 1957 with an 1.17 per cent 
increase in price; that an increase of a hundred pounds in weight, ceteris pari- 
bus, led to a 1.35 per cent increase in price; and that an increase in length by 
one inch, other things equal, resulted in a price increase of 0.39 per cent. 

That these price-quality relationships are not just figments of list pricing is 
indicated by an analysis of the same 1957 automobile models using, however, 
their used prices in 1958 instead of their new (list) 1957 prices. Consumer de- 
mands and tastes are presumably reflected more directly in prices in the used 
car market than in the new, manufacturer determined, list prices. The use of 
used prices in 1958 led to the following estimated equation: 


log. P = constant + .097H + .122W + .055L + five other dummy variables 
(.034)  (.046) (.016) R? = .948 
which is very similar to the one estimated using new (list) prices. The coef- 


ficients of all the dummy variables were significant except for the coefficients 
of the V-8 and power steering variables. No serious bias was apparently intro- 





1? For details and a similar analysis for other years see Zvi Griliches, “Hedonic Price Indexes for Automobiles: 
An Econometric Analysis of Quality Change,” Bureau of the Budget-—NBER Price Statistics Review Committee 
Staff Report No. 3. 
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duced by the use of list prices instead of actual transaction prices in the 
analysis. 

That not all the (dp/da;;)/p’s remain constant over time is illustrated by 
Table 1, which presents additional estimates for 1950 and 1960 derived from 
similar regression equations. It indicates a very substantial fall in the “price” of 
horsepower and length since 1950. 


TABLE 1. ESTIMATED QUALITY WEIGHTS OR ‘PRICES”—(dp/da;;)1/p: 
PERCENTAGE CHANGE IN THE PRICE OF AUTOMOBILES AS THE 
THE RESULT OF A UNIT CHANGE IN SELECTED 
QUALITIES, SELECTED YEARS 








Percentage change in price per 





10-unit change | 100 Ibs. change One inch change 


in horsepower in weight in length 








1950 , ) 1.92 
1957 : : .39 
1960 


From Griliches, op. cit., Table 7. 


3. RELATIONSHIP TO THE CONSUMER PRICE INDEX 


We see, then, that even for as complicated a product as automobiles the 
dp;/da;; can be estimated statistically. Before we can proceed to incorporate 
quality adjustments into an index, however, we must find out in what form and 
with what weights the dp;/da,; should be applied. To do this we start by con- 
sidering a value aggregate 


V = Dd pag (4) 


where 7p; is the price of the ith product and q; its quantity. This value aggre- 
gate, of course, may represent any flow of monetary expenditure or income 
(e.g., consumers’ budgets, the gross national product, etc.). A small change in 
V can be written as:'8 


dV = > pdqit Dd qidpi. (5) 


Replacing dp; by its equivalent from (3), we get'® 


Op; 
dV = : pidqi + : qdp; + >» i >> — daj;. 


§ OAs; 


Dividing both sides of (6) by V =2,q,p: yields 





18 This technique was first applied to Index numbers by F. Divisia, “L’indice monétaire et la théorie dela 
monnaie,” Revue d’ économie politique (1925). 

19 In this derivation dV, dg and dp’ are treated asymmetrically from the quality component inasmuch as the 
first three quantities are evaluated in a purely ex-pvst fashion, without inquiring into the mechanisms of their 
determination. For a different approach, see the excellent paper of F. W. Dresch, “Continuous Index Numbers 
and the Quantitative Study of the General Economy,” Proceedings of the Berkeley Symposium on Mathematical 
Statistics and Probability, University of California Press, 1949, pp. 203-22. 
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DVrdqa DL gdp; 
dV os: i + i + ; i 
Vo Spa D pg D igi 


We now define dV = Vi— Vo, dqi=qit—Qio, €pi’ = Pin — Pio, ANA dari; = dij — aio, 
where the “t” subscripts denote the value of the variable at time ¢ and the zero 
subscripts indicate the value of the same variable during the base period. With 
these definitions, and the appropriate dating of p,, q; and dp;/da;; (7) becomes 


Op; 
> PitQit ) Piodie p Qio( Pit’ ial Pio ) > qio p> (*) (aij — ajo) 
=" +— ope - (8) 
2 P i0Fio p> Pi0dio iu QioPio ) FioPio 


In (8) the first term on the right-hand side is the usual Laspeyre quantity index; 
the second term is a Laspeyre index of polygenetie price change and the third 
term is a Laspeyre quality change index. 

If we multiply (3) (with the above dating) by gio and sum over 7, we get 


’ Opi 
Dd gdp: = DY qiudpi + DY gio dD (- ) dei;. (9) 
a 7 J 0 


ij 


















(7) 

























‘ 













Op; 
DY qudps >> qiodp,’ > a0 > (<*) dai; 
v ] 0 


1 — + i I 0a; 
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Hence, if, in the base period, pj is taken to be equal to pio by definition, 


, Opi 
D> qiopie DY quopie Y gio x( ) dai; 
i ry j 


aij/ 0 
+ 
p> JioPio 


(10) 
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The left-hand side of (11) is an ordinary (consumer or wholesale) price index. 
The first term on the right-hand side is an index of the polygenetic price 
change, while the second term on the right-hand side is our proposed index of 
quality, Q. From (11) it is evident that the price index abstracting from quality 
change may be obtained by taking the difference between the appropriate price 
index (consumer or the wholesale price index) and the relevant quality index. 
Q, of course, measures the changes in the prices of a certain basket of goods 
induced, under base period conditions, by differences in quality. In this index, 
quality variations are weighted not only by the movement in price resulting 
from a unit change in a;; but also by the weight of the 7th product in the ordi- 
nary price index. 

It should be noted that in view of the fact that (3) applies exactly only to 
small changes in V, both (8) and (11) are precise expressions only for small 


AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1961 


variations dp’, dq and da,;. In normal times, however, movements in price, 
quantity and quality all tend to be gradual. Therefore, the additive decompo- 
sition of observed price changes into a polygenetic price index and into a quality 
index given in (11) and the analogous separation of changes in V expressed in 
(8) are good approximations for short time steps. To enable accurate long term 
comparisons, we suggest that all the separate indices in (8) and (11) take the 
form of continuous chain-indices. However, if resource limitations do not per- 
mit this approach, both (8) and (11) can still be used as approximations. These 
approximations become less valid, though, the longer the time elapsed between 
weight revisions in each index. 

If it is merely the polygenetic price index which is of interest, it is, of course, 
not necessary to construct the quality change index Q explicitly. The informa- 
tion embodied in Q can be used to correct the price quotations employed in the 
computation of the CPI or WPI. For, (11) can also be written as 


’ Opi 
> QioPit >: qio |p = x( ) (avije sn a) | 
F By ; i \dais/o 





7, QioPio rw ys JioPio 


The second terms in the bracket of the right-hand side, which represents the 
cost of changes in the quality of the 7th good at base period prices, is the re- 
quired adjustment term. 

In the opinion of the authors, however, the availability of a separate index 
of quality would provide the user of index numbers with more information and 
with a greater degree of flexibility. Furthermore, if quality adjustments were 
applied to a large number of products, the construction of a separate quality 
index for the entire economy requires very little additional work than that al- 
ready implied by the adjustment technique itself. Therefore, it would appear 
that the index Q ought to be evaluated explicitly along with the other BLS 
indices. 

The possible magnitude and importance of applying adjustments for quality 
change consistent with the second right-hand term of (12) can be evaluated 
by reference to the previously described analysis of automobile, price-quality 
relationships. Table 2 presents the changes that occurred since 1950 in the 
specifications of the average “low-priced three” 4-door passenger sedan, the 


TABLE 2. CHANGES IN THE SPECIFICATIONS OF THE AVERAGE 
“LOW-PRICED THREE” CAR—dai; 








Change in Change in | Change in 
horsepower weight, lbs. | length, inches 





1950-56 (6 cyl.) 41 73 3.6 
1956-60 (V-8) 14 369 11.8 








Average for three Chevrolet, three Ford, and two (the two lower priced series) Plymouth models. See Griliches, 
op. cit., Table 6. 
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TABLE 3. OFFICIAL AUTOMOBILE PRICE INDEXES, QUALITY CHANGE 
ADJUSTMENTS USING ESTIMATED 1957 QUALITY CHANGE WEIGHTS, 
AND THE ESTIMATE POLYGENETIC PRICE INDEX. PERCENTAGE 
CHANGES: 1950-56, 1956-60, AND 1950-60 








Quality Change Adjustment— Polygenetic Index 
op dai; 


CPI 
Period New 
(model) | Automobile Op (aize—axigzo) 
Price Do “Gay Po 


Horse- 
Inde Tat 
ndex power Weight | Length 


daz —p 








15.86! 4.51 
1956-60 14.16? 1.68 
1950-60 32.26 




















* Computing by linking the 1950-56 ard 1956-60 estimates. 
1 March 1950 to November 1955. 
2 November 1955 to November 1959. 


type of car priced by the CPI.*° Table 3 uses these changes together with 
estimated 1957 (dp/da;;)/p to adjust the published CPI automobile price 
index for quality change. The resulting polygenetic index of automobile price 
change (1950-1960) is presented in the last column of Table 3 indicating a 
price rise only one-third as large as the one indicated by the official automobile 
price index. If we had used the substantially higher 1950 quality change 
weights, the polygenetic index would have actually shown a 23 per cent fall in 
the price of automobiles since 1950. If we had used the somewhat lower 1960 
weights, our polygenetic index would have shown a rise of about 16 per cent, 
still less than half of the observed rise in the official index. 

Thus, quality change appears to have played a very important role in the 
pricing of automobiles. If, as is quite likely, the importance of quality change 
for other commodities is anywhere near the same order of magnitude, the impli- 
cations for the measurement of changes in the general price level and for policy 
conclusions drawn from them are obvious and extremely important. Even if 
we adjust the overall CPI just for the quality change in one commodity—auto- 
mobiles, using 1950 weights—we would reduce the index from 125.6 in No- 
vember 1959 (1947-49—100) to 123.4 eliminating close to 9 per cent of the 
overall price rise since 1947—49.”' It is not at all improbable that correctly meas- 
ured there has been no real rise in the general price level slnce 1952.” 





2° Sixes before 1956 and V-8’s since. This shift in specifications is made to conform with the CPI shift to the 
pricing of V-8's in 1956. 

21 1950 quality weights are appropriate to use in conjunction with a fixed weight, 1950 expenditure survey 
based CPI. The automobile index was assumed to have a weight of 3.7 per cent in the CPI, the weight of all new 
and used automobiles in 1950. This procedure essentially ascribes the same quality change estimate to used cars 
as to the new ones. 

2 The attention that has been paid by policy makers to this imperfect measure of price change (the CPI) 
has perhaps been too close and may have led them into error. If, as we believe, there may not have been any real 
“new inflation,” then the restrictive monetary policy that was provoked by the apparent changes in the price 
indexes was really unnecessary. “If” the Fed had not watched the CPI so closely, it might not have tightened so 
much, and we may not have had to experience the 1960 recession. “If” unemployment in 1960 would have been 
somewhat lower, the Republicans might have won the election instead of losing it because they could not take 
their eyes off the wrong indexes. 
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4. THE QUALITY INDEX AND CONSUMER PREFERENCES 


In order to get a better idea of the relevance and limitations of the quality 
adjustments proposed, we shall now explore the relationship between the 
quality index Q and a “constant satisfaction” index of quality. We shall show 
that the quality index proposed bears the same relationship to an index meas- 
uring the change in satisfaction due to quality change as the usual consumer 
price index bears to a constant utility price index. 

We start by postulating the existence of an index of utility function 


U= U(q, * * © 5 Un; B11, Biz, * * * » Mim; M21, M22, * °°, 
Gow; * * * Qnty Ana, * * * » Gas); (13) 


into which the quality characteristics of each product have been introduced 
explicitly.% As usual, we assume that the consumer maximizes his utility sub- 
ject to the budget constraint, 


Le Pig = Y; (14) 


where y is the consumer’s income, and we form the Lagrange function 


R= U(4, aij) + a(  ® Piqi — ). (15) 


In order to explore the logical relation of the weights in the index Q to con- 
sumer preferences we shall now introduce two simplifying assumptions. These 
hypotheses are appropriate in many situations but do not hold in a number of 
practical cases one can construct. Nevertheless, their inclusion in this section 
is for convenience in the exploration of the conceptual properties of the quality 
index, and the conclusions that will be drawn will be suggestive, though not 
rigorous, for situations in which the hypotheses are not satisfied. 

First we assume that quality is a continuous variable. Secondly, we shall 
restrict one analysis to situations in which 

oe 0. (16) 
0ai; 

This latter postulate implies that a typical consumer feels that there is no 
direct substitution between quantity and quality. It holds strictly for some 
products (e.g., most consumer durables of which, in general, only one item is 
bought at a given time), and is probably not a bad approximation for many 
others, although it is obviously a poor approximation for an item such as 
women’s stockings. This assumption does not imply that the quantity of an 
item purchased is independent of its quality. On the contrary, since quality 
has a price tag attached to it the quantities of each item consumed depend ex- 
plicitly on its quality, as will become apparent below. 

Setting the partial derivatives of R with respect to the a,;; and the g; equal 
to zero yields the condition that, for each a;;, 





*% This procedure was also adopted in Houthakker, op. cit., p. 156. 
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and for every q; 
(18) 


Equations (17) and (18) can be solved simultaneously to determine the equi- 
librium values of the a;; and the qi. 

Since \ is the marginal utility of income (17) states that, in equilibrium, the 
marginal utility of a given quality dimension must be equal to the marginal 
utility of the income spent to acquire the increment in a,;. For the mth and the 
nth quality dimensions of the same commodity 7, then, we have from (17) 


Op; 
OQim 
Op; 


Odin 





(19) 


That is, the marginal rate of substitution between quality characteristics of the 
same commodity must be proportional to the ratio of their “quality prices.” 

If we now compare the right-hand side of (19) to the index Q, it becomes 
apparent that, in equilibrium, the relative weights in the index of different 


quality dimensions of a given product are equal to the ratios of the increments 
in satisfaction to which they give rise. 

Similarly, for the mth quality of the rth good with the nth quality of the sth 
good, one may write from (17), 


aU 
OQrm 


aU 


OQsn 


(20) 


Since the right-hand side of (20) is precisely the ratio of the weights of dar, 
and da,, in the index Q, the ratio of the weights of changes in the quality 
characteristics of two different products is, in equilibrium, just the marginal 
rate of substitution between their respective quality dimensions. Thus, for a 
consumer in equilibrium the index Q weights changes in various quality char- 
acteristics in proportion to the increase in satisfaction generated. 

Suppose now that the ratio of the marginal rates of substitution between 
Grm and a, is smaller than the ratio of the costs exacted for these quality 
changes. Under non-administered price situations, both dp;/da,; and consumer 
purchases will tend to adjust until (20) is satisfied and equilibrium with con- 
sumer satisfaction is reached. If prices are administered, and therefore not free 
to vary, equilibrium between the marginal rates of substitution and the cost 
of the quality dimensions will be reached by a decline in the ratio of q, to q. 
Although under these circumstances 
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OPrm / ODen 


Orn Odisn 


will be “too high,” the adjustments introduced in the quality index will still be 
representative of changes in consumer satisfaction provided the chain-index 
approach recommended is used. For, as we saw above, in the quality index Q 
the dp;/da,; are weighted by the quantities of the item purchased. 

In the construction of the ordinary consumer price index, one finds the 
analogous relationship between the weighting of price changes and consumer 
satisfaction. Therefore, the proposed index of quality Q provides an appropri- 
ate deflation for a “constant satisfaction” consumer price index. 

A knottier problem arises where a new quality variation is introduced ac- 
companied by an advertising campaign which successfully alters tastes. For, 
under these circumstances, it is not clear whether the original or the new 
preference patterns should be used to evaluate the quality index. However, 
this problem arises in essentially the same form with respect to the consumer 
price index as well. While it remains unsolved, the impact of changes in tastes 
is minimized by the use of continuous chain indices. 

Finally, it should be noted that the approach recommended here is valid 
primarily for small changes in quality. The impact of truly discontinuous tech- 
nological breakthroughs can only be evaluated approximately, by an attempt 
to approach this abrupt change through a linking procedure involving a num- 
ber of intermediate steps. However, in normal times, truly discrete changes in 


quality are quite few even in as technologically dynamic an economy as that of 
the United States. 





LENGTH OF CONFIDENCE INTERVALS* 


Joun W. Pratt 
Harvard University 


The expected length of a confidence interval is shown to equal the 
integral over false values of the probability each false value is included. 
Thus two desiderata for choosing among confidence procedures lead 
to the same measure of desirability. Furthermore, by common defini- 
tions of “optimum,” a procedure is optimum as regards including false 
values if and only if it is optimum as regards expected length. How- 
ever, the procedure with minimum expected length ordinarily depends 
on the true value of the parameter. The possibility is explored of mini- 
mizing the average expected length, averaging according to some 
weighting on the possible parameter values. (This is not the same as 
assuming a prior distribution and using Bayes’ Theorem.) The ideas 
are applied to the mean and variance of a normal distribution and the 
probability of success in binomial trials. 


INTRODUCTION AND SUMMARY 


O MEASURE the desirability of a confidence interval procedure, one would 

like to measure in some way the extent to which the interval includes false 
values. Since at most one point of the interval is the true value, one natural 
measure is the length of the interval (if it is fixed) or the expected length (if it 
is random; assume that both end-points are finite). That is, the expected 
length of the interval is a measure of the “average extent” of the false values 
included. 

There is another approach to the problem. A natural measure of the extent 
to which the confidence interval procedure includes a particular false value is 
the probability of including that particular value. To “average” this over all 
false values, one might simply integrate it over the false values. This gives an 
apparently different measure of the “average extent” of the false values in- 
cluded. 

It turns out, however, that the two measures are equal. ‘l’o be more specific, 
let L<r<U be a confidence interval for a parameter r. The expected length of 
the interval is E{ U—L}. The probability that the interval includes the false 
value rt is P} L<7r<U }. The fact stated above is that integrating this proba- 
bility over r (omitting the true value ro) gives exactly the expected length: 


E{U - L} -f P{L <1 < U}dr. 
T#TO 


In Section 1 we draw some conclusions from this fact, after stating it more 
carefully and proving it. The well-known relation of confidence intervals to 
tests permits another interpretation of expected length (Section 2) and shows 
how to minimize it (Section 3). However, the minimizing procedure generally 
depends on the true value of the parameter. In Section 4, accordingly, the ex- 





* This research has been supported by the United States Navy through the Office of Naval Research, under 
contract Nonr-1866(37). Reproduction in whole or in part is permitted for any purpose of the United States Govern- 
ment. 
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pected length is integrated over the possible values of the parameter, and the 
resulting “average” expected length is minimized. Sections 5-9 deal with some 
examples. The mean of the normal distribution is discussed in Section 5. In 
binomial trials (Section 6) one is led to procedures previously suggested by 
Sterne [7] and Crow [4]. The corresponding randomized procedures may con- 
sist of two disjoint intervals. This pathology is discussed briefly in Section 7. 
Section 8 is concerned with the variance of the normal distribution, and ilius- 
trates the behavior of invariant procedures in the general context of this paper. 
Section 9 brings out the relation between the procedures of Section 8 and Bayes 
procedures. A generalization given in Section 10 covers one-sided problems, 
which are discussed in Section 11. The concluding remarks (Section 12) depend 
for intelligibility only slightly on Sections 1-11. 


1. EXPECTED SIZE AND COVERING FALSE VALUES 


Let X be the observed random variable, and suppose R(x) is a confidence 
region for a “parameter” r. (X need not be restricted in any way: it may be a 
single observation or a sample from a univariate or multivariate distribution 
or from a distribution on an arbitrary space. The problem need not be “para- 
metric” : for instance, the distribution sampled might be an arbitrary univariate 
distribution and 7 its median.) 

Suppose »u(2) is a measure of the size of the region R. The discussion below 
may be read at any of the following levels of generality. 

A. + is one-dimensional, R(X) is an interval with end-points L(X) and U(X), 
and u(R) =U —L, the length of R. Then du(r) below may be interpreted as dr, 
so Srdp(r) = Sidr =U—L=nz(R). 

B. r is one-dimensional, FR is a union of (one or more) disjoint intervals, 
u(R) is the sum of their lengths, and du(r) =dr. 

C. r=(1, - ++, te) is dimensional, R is any region in k-space, u(R) is the 
k-dimensional volume of R, and du(r)=dr, + - - dt. 

D. + is k-dimensional, R is any region in k-space, u(R)=frW(n,---, 
ty)dry + + - dry, and du(r)=W(n, ---, te)d71 + - - drx, where W is « non-nega- 
tive function. 

E. 7 is arbitrary, R is any region in r-space, and uz is a o-finite, non-atomic 
measure on r-space. (Allowing atoms would change no essential conclusions of 
this paper, but would require the addition of another term on the right-hand 
side of (3) and similar equations below.) 

E includes A—D. Discussion of measurability in C—E will be omitted. 

In each case, 


u(R) -f du(r). (1) 
TER 


Since R(X) is a random region, its size u(R(X)) is also random in general: 
hence, if size is the criterion, it is natural to consider the expected size. This 
turns out to be the probability that R(X) includes r integrated over the possible 
values of 7, that is 


B{uR(X))} = f PURO) Dd r}duCr. (2) 
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The expectation on the left and the probability on the right must be computed 
under the same distribution of X. 

To prove equation (2), simply substitute R(x) in equation (1), integrate over 
the distribution P of X, and reverse the order of integration, thus: 


EURO) = ff dura) 
rE R(z) 


-ff aP(2)du(s) = f P{R(X)Dr}du(, Q.E.D. 
R(z)3r 


Now the omission of a single value of 7 from the range of integration on the 
right-hand side of equation (2) will not change the value of the integral. There- 
fore, 


B{u(ROX))} =f PLR) Dd +}du(r. (3) 


THT 


If ro is the true value of 7, this says the expected size is the probability that 
R(X) includes 7 integrated over the false values of r. 

We shall see that (except under pathological circumstances), for each dis- 
tribution P of X, there is a confidence procedure which, for every 7, minimizes 
the probability under P of covering r. By (2) or (3), such a procedure has mini- 
mum expected size under P for every measure u of size. Conversely, for any 
particular measure yu of size, if R has minimum, and finite, expected size under 
P, then R uniformly minimizes the probability under P of covering false values 
(except possibly for a set of false values of size 0, that is, of u-measure 0), The 
same statements hold if attention is restricted throughout to a suitable particu- 
lar class of procedures (for instance, to “unbiased” procedures; see the next-to- 
last paragraph of Section 3). Thus, if there is an “optimum” procedure as re- 
gards including false values, it is also “optimum” as regards expected length, 
and vice versa. 


2. EXPECTED SIZE AND TYPE II ERROR 


Confidence regions and families of tests are associated by the relation given, 
for instance, by Lehmann [5, p. 174], 


R(x) S>r7 ifandonly if x € A(z). (4) 


Here A(r) is an acceptance region for the null hypothesis that 7 is the true 
value. If a confidence procedure R(x) is given, equation (4) defines an accept- 
ance region A(r) for each 7; conversely, if an acceptance region A(r) is given 
for each r, equation (4) defines a confidence procedure R(x). For any 7 and any 
distribution P of X, (4) implies 


P{R(X) Dr} = P{X E A(n)}. (5) 
Thus, in particular, the probability under P that the confidence region R(X) 


will include the false value + equals the type II error probability at P of the 
associated test of the null hypothesis that r is the true value. Substituting (5) 
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in (2) and (3) gives another interpretation of the expected size: 


B{u(R(X))} = f PIX € AG) }duG) 


-{ P{X E A(r)} du(r). 


TT 
3. MINIMIZING EXPECTED SIZE 


It is now clear how to minimize the expected size under P for any particular 
distribution P of X. For each 7, choose a test of the null hypothesis that r is 
the true value which is most powerful against the alternative P. The associated 
confidence region has minimum expected size under P, because it minimizes 
the integrand of equation (6), at each r. 

A most powerful test exists except in pathological circumstances [5, p. 81]. 
It can be found by the Neyman-Pearson Lemma [5, p. 65] if r completely spec- 
ifies the distribution of X. Specifically, if X has density or frequency function 
f under P and f, under r, a most powerful test is 


reject " - I(X) Ann 
f(X) ( <e(r), 


where c(r) and the rejection rule at X’s where f(X)/f,(X) =c(r) are chosen to 
give the test the required level at each r. The confidence procedure with mini- 
mum expected size under f is that corresponding to (7), which includes or ex- 
cludes 7 according as (7) accepts or rejects r. 

If P is a uniform distribution, then f(z) is constant and (7) becomes 


reject r if f(X) <e,(r), 
acceptr if f,(X) > e,(r), 


(7) 


accept r 


(8) 


where ¢,(r) and the rule at equality are chosen to give the required level. This 
is an old (and perhaps old-fashioned) rule for choosing critical regions. 

If P satisfies the null hypothesis that r» (say) is the true value, the power 
against P and the size of a test of the null hypothesis that ro is the true value are 
equal. One may select the test arbitrarily for this one value of r, and indeed for 
any finite (or even denumerable) number of values of 7, since this will not affect 
(2) or (6). 

In a particular problem, there is ordinarily a class of possible distributions of 
X. Nothing so far has required that P belong to that class. Even if this is re- 
quired, however, under most assumptions about the class of possible distribu- 
tions, the most powerful test against P and hence the confidence procedure 
with minimum expected size under P and minimum probability under P of 
including false values will not be the same for every possible distribution P. 

One possibility is to take some kind of average over the possible distributions 
P of the expected size under P. This is discussed in the next section. 

Another possibility is to restrict consideration to unbiased confidence regions, 
that is, regions for which the probability of including a false value is less than 
or equal to the probability of including the true value. By (5), a confidence 
region is unbiased if and only if the corresponding tests are all unbiased [5, p. 
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177]. Sometimes there is a uniformly most powerful unbiased test of the null 
hypothesis that r is the true value for each r. Then the associated confidence 
procedure will, among unbiased procedures, uniformly minimize the probabil- 
ity of including false values and the expected size of the confidence region for 
every possible distribution P and every measure u of size, that is, uniformly in 
P and uz. 

Another possibility in some problems is to restrict consideration to invariant 
confidence procedures [5, p. 243]. By (5), a confidence procedure is invariant if 
and only if the corresponding family of tests is an invariant family. Sometimes 
there is a uniformly most powerful invariant test of the null hypothesis that r 
is the true value for each 7 and these tests form an invariant family. Then the 
associated confidence procedure will, among invariant procedures, minimize 
the probability of including false values and the expected size of the confidence 
region for every possible distribution P and every measure yu of size. For a 
particular P, possible or not, except in pathological circumstances, there is for 
each 7 a most powerful test against P among invariant tests of the null hy- 
pothesis that 7 is the true value. If these tests form an invariant family, then 
the associated confidence procedure will, among invariant procedures, mini- 
mize the probability under P of including false values and the expected size 
under P of the confidence region for the particular P and every measure yu of 
size. If the most powerful invariant tests against P do not form an invariant 
family, then no invariant family minimizes the integrand of equation (6) at 
each 7, and a different method of minimizing is needed. The minimizing pro- 
cedure may, in fact, depend on the measure of size used. This will be illustrated 
(Section 8), but not discussed in general because considerable machinery would 
be required. 


4, AVERAGE EXPECTED SIZE 


One might, instead of invoking unbiasedness or invariance, average the ex- 
pected size under the different possible distributions P according to some sys- 
tem of weights. Specifically, let @ index the possible distributions p, of X, and 
consider the “average” expected size [E»{u(R(X))}dv(0), where dv(@) may be 
interpreted for one-dimensional @ as d@ (compare A and B, p. 550), for m- 
dimensional @ as dé,--- d0, (compare C) or W(j,- ++, O0m)d0:--- dOn 
with W>0 (compare D), or vy may be a general measure, even one with atoms 
(compare E). Substituting P, for P and r(@) for ro in equation (3) and integrat- 
ing gives 


f Btuceamjano = ff PAR) S r}dniro. ——) 
Ter( 


This says the average expected size is the probability of including a false value 
integrated over false values and over the possible values of 86. 
Integrating (2) and substituting (5) gives 


f Btuean}a@ = ff Pitz d duran 


“ f Ps{X E A(r)}dv(0)du(r). 
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Let P be the “distribution” obtained by “mixing” the distributions Ps» ac- 
cording to the weighting v. In other words, let P be the marginal “distribution” 
of X when @ has marginal “distribution” v and X has conditional distribution 
P, given 6. (P has total weight 1 and hence is a true probability distribution 
only if the same holds for ».) Let FE be the “expectation” corresponding to P. 
Then (2) is exactly the first equality of (10) and the first equality of (6) is 
exactly the second equality of (10). 

The average expected size is minimized by the confidence procedure associ- 
ated with a family of tests A(r) such that, for each r, among tests of the null 
hypothesis that 7 is the true value, A(r) is a most powerful test against the al- 
ternative “distribution” P just described; that is, the acceptance region A(r) 
minimizes 


P{X € A(r)} = [ PAX E AMO, (11) 


This is just like the situation of Section 3 except that P may not be a true 
probability distribution. A minimizing A(r) exists except in pathological cir- 
cumstances. It can be found by the Neyman-Pearson Lemma when the null 
hypothesis that 7 is the true value is simple. Specifically, if 7(@) =@ and X has a 
density or frequency function f(x), a minimizing procedure is: 


ef PHO 
reject | ie —— {rer (12) 


f(X) 


where c(r) and the rejection rule when equality holds are chosen to give the 
procedure the required level at each r. 

Even in an apparently reasonable problem, it can turn out that the minimiz- 
ing region is not always an interval (Section 6). 

The minimizing procedure for a particular weighting v may have infinite 
average expected size for some measure yu of size. Then the average expected 
size of every procedure is infinite for this 4. However, the minimizing procedure 
for the weighting » minimizes the average expected size for all yp, including 
trivially those » for which the minimum is infinite. It will also nearly minimize 
the average expected size for a weighting v’ near v, and the minimum for a par- 
ticular » may be finite under v’ though infinite under ». 

Infinite minima arise most naturally when the total weight of » is infinite, 
but the preceding paragraph applies even if the total weight is finite, and even 
(with trivial alterations of terminology) to Section 3. 


accept r <c(r), 


5. EXAMPLE. MEAN OF A NORMAL DISTRIBUTION 


If X has a normal distribution with a known variance of 1, the usual level 
1—a confidence interval for the mean r is 


X — fap St SX + kay, (13) 


where £./2 is the upper a/2-point of a normal distribution with mean 0 and 


variance 1. The length of this interval is constant at 2£4,/. 
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We will now find the procedure with minimum expected length when the 
mean is 0. For 740, a most powerful test of the null hypothesis r against the 

alternative 0 accepts 7 for 
X<rt+ésh if 7r<0; 


i (14) 
X>r-& if r>0. 


At r=0, any test may be used; for convenience, let all x be in the acceptance 
region for the null hypothesis r =0. The interval of all s which would be accepted 
by the above tests minimizes the expected length when the mean is 0. This 
interval is (see Figure 1) 

X—-%<7r<0 if XS —8,; 

X-t S7rsXt+éa if —t, < X < &; 


O<7rsxX+i& if &<X. 


This can be written min{0, X—£.} <r<max {0, X+é,}. 
Although this interval may be arbitrarily long, when the mean is 0 it will 
have length 2&4 with probability 1—2a and expected length 


2[(1 — a)ée + (2x)-1ete"/8], 


Table 1 shows how much is gained in terms of expected length, when the mean 
is 0, by using (15) instead of (13). The last line gives the efficiency of (13) with 
respect to (15) in the sense of the ratio of the sample sizes needed to achieve 
the same expected length when the mean is 0. 


TABLE 1. COMPARISON OF TWO CONFIDENCE INTERVALS 
FOR A NORMAL MEAN 





Confidence level, 1—a : .90 95 .98 .99 .999 








Expected length of interval (15) | 

when the mean is 0 1.907 2.658 3.332 4.122 4.659 6.181 
Length of the usual interval 2.563 3.290 3.920 4.653 5.152 6.581 
Ratio squared .55 .65 42 .79 .82 .88 








Of course, if the mean is not 0, interval (15) will not have minimum expected 
length, and if the mean is far from 0, (15) will have greater expected length 
than the usual procedure (13). I am not advocating the use of interval (15). 
My real objection to (15) cannot be stated in the present framewerk, but is indi- 
cated in Section 12. 

The usual procedure (13) is of the form (8) and hence minimizes the expected 
length when X is uniform on (— ©, ©), that is, it minimizes the ordinary (un- 
weighted) integral over x of the length as a function of z. 

The usual procedure also minimizes the ordinary integral over 7 of the ex- 
pected iength when the mean is r. This follows immediately from (12) and 
amounts to the statement of the previous paragraph, since X is uniform if 7 is 
uniform. It also follows from the fact that the usual two-sided equal-tailed 
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Normal distribution, mean r, variance 1. 

Horizontal section of shaded region is acceptance region of most powerful test against 
alternative 7 =0. 

Vertical section is confidence interval with minimum expected length when 7r=0. 


The dotted lines bound 
the corresponding region for \\ 
the usual two-sided test and 


interval. 
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Fia. 1. A family of tests and the corresponding confidence procedure for the mean 
of a normal distribution. 





test is uniformly most powerful invariant (under the transformation of x carry- 
ing x—7 into r—2) and hence minimizes (10) for dv(@) =dé. 

The previous two paragraphs apply, of course, to any other measure of size 
of the sort considered in this paper. 


6. EXAMPLE. PROBABILITY OF SUCCESS IN BINOMIAL TRIALS 


Let S be binomially distributed with parameters n, p. Four confidence pro- 
cedures for p have been tabled. Clopper and Pearson’s graphs [3] and most 
tables are based on tests for each possible value of p having probability at most 
a/2 in each tail under the null hypothesis. Tables of the unbiased confidence 
intervals, which are randomized (see below) uave recently been prepared by 
Blyth and Hutchinson [1]. 

Sterne’s rule [7] is to take the smallest interval containiag the region cor- 
responding to tests of the form (8). If this region is an interval for each possible 
value of S, it follows Sterne’s procedure minimizes the expected length of the 
confidence interval when all values of S are equally likely. This expected length 
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is just the arithmetic average of the lengths of the intervals (one for each pos- 
sible value of S), and will be called the “average length” below. 

Crow [4] notes that, for some values of n, a, and S, the regions given here 
by (8) are not intervals, and he gives a geometric argument amounting to the 
relevant case of the first equality of (6) to show that the average length will be 
minimized by any set of confidence intervals corresponding to tests which 
minimize, for each p, the number of values of S in the acceptance region A (p). 
In the cases he tabled, Crow was able to choose acceptance regions containing a 
minimum number of values of S so that the corresponding confidence regions 
were always intervals. 

Thus Crow’s tables, and probably Sterne’s also, minimize the average length 
of the interval. Crow also points out that this amounts to minimizing the ex- 
pected length when p has a uniform “prior” distribution, since then all values 
of S are equally likely. 

A test or confidence procedure could depend, at least in part, on a random 
variable drawn independently of the observations from a known distribution; 
if it does not, it is “non-randomized.” Crow’s tables, and probably Sterne’s 
also, minimize the average length among non-randomized procedures. If one 
minimizes the average size among all procedures, there are only a finite number 
of values of p at which there is a choice of acceptance region, and the minimiz- 
ing confidence regions are not always intervals, as we shall see. 

An easy way to incorporate the possibility of randomization into the discus- 
sion is to include with the original observations one observation on a random 
variable U uniformly distributed between 0 and 1. In any problem, randomized 
procedures based on the original observations Y are equivalent to non-random- 
ized procedures based on X =(Y, U). 

In the binomial situation at hand, we can even let X =S+U; then S (the 
binomial variable) and U are the integral and fractional parts of X, respec- 
tively. 

If S is uniform on 0, 1, - - -, nm, where n is the number of binomial trials, 
then X = S+U is uniform on (0, n+1), and the first equality of (6) becomes 


Biron} = ff oo (16) 


This is minimized if and only if A(p) is as short as possible for every p (except 
possibly for a set of p’s of u-measure 0). Here it happens the minimizing A (p) 
can always be taken to be an interval. 

Figure 2 gives these shortest acceptance regions for n=5, a=.10. The cor- 
responding confidence regions are not always intervals, though they have mini- 
mum average size (for any measure of size). To illustrate the calculation, for 
.317<p<.401, 1 and 2 are both more likely than 3, 3 is more likely than 0, 0 
than 4, and 4 than 5. Hence 1, 2, and 3 would go into the acceptance region 
first. We have the following probabilities: 

.32 .34 .36 .38 -40 


p 
P,{S=1, 2, or3} 81560 82613 83284 .83583 .83520 
P,{S=0} . 14539 12523 10737 09161 07776 
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p =binomial parameter a=.10 
S =[X] is binomial (n =5) Horizontal section is shortest acceptance 
U =X —[X] is independent of X and uni- region. 
form on (0, 1] Vertical section is corresponding confidence 
region. 
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Fia. 2. A family of tests and the corresponding confidence procedure 
for the binomial parameter. 


Thus at p=.32, for instance, S=1, 2, and 3 are in the acceptance region; 
further probability of .9—.81560=.08440 is needed. This is obtained by ac- 
cepting at S=0 with probability .08440/.14539 = .581, say by accepting when 
U>.419. Similarly at p=.40, further probability of .06480 must be obtained 
by accepting at S=0 with probability .06480/.07776 = .833; thus it is impossi- 
ble that the corresponding confidence region at S=0 should always be an inter- 
val—there must be probability at least .833—.581=.252 that .40 is in the 
region but .32 is not. 


7. DISCONNECTED REGIONS 


If the failure of the foregoing region to be an interval is regarded as a draw- 
back, it must be held equally against minimizing expected length and against 
minimizing the probability of covering false values since these principles 
amount to the same thing, as it has been the main purpose of this paper to 
show. If one judges principles not by prior reasonableness but by how good the 
resulting procedures are (How does one judge this “goodness” without some 
prior values? One must avoid the fallacy of over-naive pragmatism.), one may 
feel this failure is a strike against integrating (2) and (3) (pp. 550-1) to get (9) 
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and (10), and that it would be preferable to invoke some other principle. Un- 
biasedness, for instance, leads to randomized confidence regions for the bino- 
mial parameter which are intervals [5, p. 179], but different from even the ran- 
domized form of the intervals of [3], [4], or [7]. (Unbiasedness here, as so 
often, picks out a single procedure from a class of “good” procedures, though 
few find it compelling in itself.) 

Insomuch as integrating (2) and (3) to get (9) and (10) seems Bayesian, this 
may seem a strike against Bayesianism. It is not. A Bayesian would be in- 
terested in a posterior probability region, not a confidence region. If the poste- 
rior distribution were not unimodal, the smallest region of posterior probability 
.90 might not be an interval, but the Bayesian would not find this objection- 
able and would feel that it was just an attribute of the particular problem. In 
the binomial problem, taking p a priori uniformly distributed between 0 and 1 
(corresponding to the weighting used in Section 6), the posterior distribution is 
unimodal anyway, having the beta density pS(1—p)"-5/B(S+1, n—S+1). 
Thus the region minimizing (10) may fail to be an interval even when the 
posterior distribution (with the same » as prior) is unimodal. 


8. EXAMPLE. VARIANCE OF A NORMAL DISTRIBUTION 


The problem of estimating a variance rt under normal models often reduces 
to that of estimating 7 on the basis of a statistic X such that X/r has a chi- 
square distribution with (say) n degrees of freedom. The chi-square density is 


Kay) = yi e-¥/2"T Gn), = y>0. 
We will be especially concerned with confidence intervals 
x 
b 


where a and b are determined by the conditions 


b 
f k,(xz)dx = 1 — a, (18) 


Knyr(a) - Kn+r(b). (19) 


For r>2—n, it follows from the properties of k,+, (it is continuous, strictly 
unimodal, and vanishes at 0 and ~) that (18) and (19) determine a and b 
uniquely. Let J, be the confidence interval determined by (17)—(19), r>2—n. 

Condition (18) makes the confidence level exactly 1—a for all r. In the fol- 
lowing discussion, “an interval of the form (17)” means one satisfying (18) as 
well. 

Tate and Klett [8] show that (i) I, is the interval of the form (17) with 
minimum length. They also mention that (ii) for samples from a normal dis- 
tribution with unknown mean and variance, the likelihood-ratio criterion leads 
to I, (iti) Iz is the only unbiased procedure of the form (17), (iv) J: is the inter- 
val of the form (17) with minimum b/a, and (v) among all unbiased regions J; 
minimizes the probability of covering false values uniformly for all false values 
and all true values. 
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Statements (i) and (iv) may be generalized as follows. Let us call a confidence 
procedure for 7 invariant if it satisfies 


rx is in the confidence region when X =z if and only if 7 is in the confi- 
dence region when X =1. (20) 


A confidence procedure R(X) for r is invariant if and only if, for some R’ not 
depending on X, 
R(X) = {r;7/X € R’}. (21) 


When PR’ is the interval (1/6, 1/a), (21) reduces to (17). 

Corresponding to the confidence region R(X) for 7 is a confidence region for 
A\=r"*, ux¥0, namely R,(X) = {A:7rE R(X) } R,(X) is invariant for \ (in the 
natural sense) if and only if R(X) is invariant for 7; then R,(X) has size 


f dy\ = uf r—ldr = uxe f z“—dz. (22a) 
AER, (X) r€R(X) 2€R’ 


For u=0, let \=log r. The size of the region Ro(X) in \-space corresponding to 


R(X) is 
f dy = tT dr = f z— dz. (22b) 
NER, (X) reR(X) 2€R’ 


It is proved below (Section 8.2) that among invariant procedures with con- 
fidence level 1—a, I2u42 minimizes (22a or b) for every X. Thus the correspond- 
ing confidence region for \=r* if u¥0 and A\=log r if u=0, has minimum size 
for every X (in the ordinary sense of size), among invariant procedures. 

In particular Jo.;2 minimizes (22a or b) among intervals of the form (17), 
which is statement (i) above when u=1 and statement (iv) when u=0. For 
u=4, we find the likelihood ratio procedure J; minimizes the length (or size, in 
the ordinary sense) of the confidence region for ¢ among invariant procedures 
with level 1—a. 

Note that, among invariant regions for 7, I2u,2 has minimum size for every 
X and the specific measure of size Jrexyt*'dr. (Compare the middle term of 
(22a or b).) This illustrates the possibility, mentioned at the end of Section 3, 
that among invariant procedures, the minimizing procedure may depend on 
the measure of size used. 

The following discussion answers to a large extent some questions raised by 
Tate and Klett [8] about the length of confidence intervals. 

A computation like that leading to (15) shows that, for any measure of size, 
the level a region having minimum expected size when in fact r= 1; is the inter- 
val 

min {7, X/k"} <7 < max {r1, X/k’}, (23) 


where k’ and k”’ are the lower and upper a-points of k,. This interval is not of 
the form (17) and depends on 7. 
Since (23) depends on 1;, let us consider the average expected size 


f Blueray} Wear. 
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The density of X under + is (1/r)k,(2/r), so by (12), a procedure minimizing 
the average expected size for every measure u of size and the particular weight 
function W(r) is 


reject "| ¥ fol) wo ‘eae 


i 
accept 7 1 k (-) <c(r). 
T Ni T 


It is proved below (Section 8.2) that, in particular, J2., is a minimizing 
procedure for W(r) =7r*~!. 

If one has a sample from a normal distribution with mean £ and variance r, 
both unknown, then all procedures may, without loss of generality, be based 
on the sufficient statistic (X, X), where X is the sample mean and X is a multi- 
ple of the sample variance such that X/7 has a chi-square distribution. If one 
also asks that a confidence procedure for 7 remain unchanged upon the addi- 
tion of a constant to all observations, one is led to procedures depending only 
on X, and the foregoing discussion applies. Nevertheless, the interval of short- 
est length based on (X, X) might not depend only on X, and Tate and Klett 
[8] ask whether in fact it does. The region with minimum expected size under 
the particular values £:, 7; does not depend only on X, since the most powerful 
test of the null hypothesis that 7 is the true value against the alternative &, 7: 
does not depend only on X for r>1 [5, p. 95]. The same kind of argument 
shows that if ¢ is any normal density, the region with minimum 





(24) 


f Een {u(RCX, X))}o@de 


does not depend only on X. However, in what might be considered the limit as 
the variance of ¢ approaches ~, the region with minimum /EF;.,,,{ u(R(X,X)) } dé 
is (23), which does depend only on X. This follows from the fact that the usual 
upper-tailed (if 7:>7) or lower-tailed (if 7;<7) test based on X is the most 
powerful test of the null hypothesis that r is the true value against the alterna- 
tive that has a uniform “distribution” on (— ©, ©) and 7; is the true value 
of r. Similarly, (24) is the region with minimum SE, {u(R(X,X)) } W(r)dédr, 
and in particular I,_». minimizes [E;,,{u(R(X,X)) }r*—dédr. 

Clearly the situation is similar in many more complicated normal models. 

8.1. To prove that J2,,2 minimizes (22a or b) among invariant confidence 
procedures with confidence level 1—a, note that when 7 is the true value, X/r 
has density k,,(y) and +/X therefore has density z~*k,(1/z). Accordingly, when 
R(X) is given by (21), 


P,{r € R(X)} = P,{r/X € R’} 


= f 2~*k,(1/z)dz. 
2 


(25) 


We want to minimize (22a or b) subject to the condition that the value of (25) 
be 1~a. According to the Neyman-Pearson Lemma, we may choose R’ to be 
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the region where 
zk, (1/z)/z*-"! <. (26) 
or, since y“*'k,(y) is a multiple of ky42u+2(y), 
kn+outa(1/z) < ce’, (27) 
where the constants c and c’ are chosen to make (25) 1—a. By the properties 
of kysous2, R’ must be an interval, say (1/b, 1/a). At the end-points equality 
must hold in (27), giving (19). Therefore J,,,2 is the minimizing procedure. 


8.2. To show that (24) reduces to Io, if W(r) =7*~', note that for this W(r) 
the numerator in (24) is a multiple of X“~!. Hence (24) becomes 


reject r a X <e,(r), 
if {— kn{ — (28) 

accept r T T >e1(r). 
Now ¢;(r) is to be chosen so that the probability under + of rejecting r is just 
a. Since the distribution under r of X/r is the same for every 7, the required 


¢:(r) is the same for every r. Since also y'~*k,(y) is a constant multiple of 
kense-2u(y), (28) is of the form 


reject a XxX f <c, 
{ if k, +2—2u 7 (29) 
accept r T l>e. 


By the properties of k,42-2., (29) is of the form 


xX 
acceptr if a<—<b, reject otherwise. (30) 


r 


(What we do at the end-points does not affect the probability.) Equation (30) 
is equivalent to (17). Equation (19) follows from the fact that equality must 
hold in (29) at the end-points of (30), and (18) is just the condition determining 
c in (29). Thus J_2. is the region (24) when W(r) =r“"!. 

The result of 8.1 follows from the one just proved, since, for any region of the 
form (21), 


» X\ dz 
J Belaceco jortaa -fff du (eka) oa 
re R(2) 6 6 
-f eoide f r~kaoddy f edu, (31) 
2€R’ 


9. RELATION TO BAYES PROCEDURES 


where z=7r/z and y=2/8. 


The confidence intervals of the previous section can also be interpreted as 
posterior probability intervals for appropriate prior distributions of 7. Specifi- 
cally, suppose r is distributed a priori with density proportional to 7°-', that 
is, r° (or log r, if v=0) is uniformly distributed a priori. Then the posterior 
density of r“ given X is (by Bayes’ Theorem and straightforward calculation) 
proportional to ky42u—2.42(X/r). In particular, taking u= —1, it follows that the 
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posterior distribution of X/7 given X is chi-square with n—2v degrees of free- 
dom. This means that a confidence region at level 1—a computed as though 
there were n—2v degrees of freedom is a region of posterior probability 1—a. 
If v=0, that is, if log r is uniformly distributed a priori, ordinary confidence 
regions at level 1—a have posterior probability 1—a. These facts are not new, 
and are included only to set the stage for statement of the relation between the 
procedures of Section 8 and Bayes procedures. 

Since the posterior density of r“ given X is proportional to k,.42u-2042(X/r), 
the smallest region in r“-space of posterior probability 1—a will be an interval 
with end-points corresponding to values of X/r at which k,42u-2942(X/r) has 
the same value. 

Thus we find that, if r° (or log 7, if v=0) is uniformly distributed a priori, 
then the smallest region of posterior probability 1—a for r“ (or log r if u=0) is 
Tou42 computed as though there were n —2v degrees of freedom. Thus the effect 
of these prior distributions is to chahge the degrees of freedom, while the effect 
of looking for small regions in r“-space is the same for posterior probability 
regions as for invariant confidence regions. When average expected size is 
minimized, as at (24), the effect of the weight function W(r) on the confidence 
region is different from the effect on the posterior probability region of the 
same weight function used either as prior distribution or as measure of size. 

Lindley has pointed out [6, Introduction] that the unbiased procedure I, 
gives the shortest region of posterior probability 1—a for log r if log 7 is uni- 
formly distributed a priori. (If instead r” is uniform a priori, the degrees of free- 
dom need only be reduced by 2v.) Lindley also points out that integrating by 
parts shows I, is the same as J. on two fewer degrees of freedom, so that tables 
of I, can be used for Io, which has minimum size for o~? (not o?, as stated [6, 
Section 1, last paragraph ]). 


10. MEASURE OF SIZE DEPENDING ON THE TRUE VALUE 


In some confidence problems, in particular, one-sided ones, the undesirability 
of covering a false value depends on the true value. Sections 1-4 are easily modi- 
fied to allow this. 

Specifically, suppose @ indexes the possible distributions and we allow yu to 
depend on 6. This excludes A—C, p. 550, but under D (E is no more interest- 
ing), we would have, instead of (1), 


u(R) = f W(r, 0)dr, (32) 
TER 
where dr=dr, - - - dr and W is non-negative. Similarly, (2) and (3) become 


E{uo(R(X))} = f P{R(X) D1} W(z, dr 
(33) 


ys f P{R(X) D 7} Wr, Badr. 
r#7 (0) 


(10) becomes 





AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1961 


f E4{ ue(R(X))}dv(6) = f P.{ R(X) D 1} W(r, O)drdv(6) 
(34) 
J f Ps{X E A(s)}W(r, 0)dv(0)dr, 
and, as in (9), the integral is unchanged if one integrates only over false values, 
that is, over the region r¥7r(@). 


(34) is minimized by choosing, for each 7, the most powerful acceptance re- 
gion A(r) against the alternative P® given by 


P®O{XE A} = f P»{X € A} W(r, 0)dv(@). (35) 


(Compare (11).) If r(@)=@ and X has density or frequency function fo(x), a 
minimizing procedure is: 


f fone, odo 
fa) 





reject r 
\ (36) 


accept r 


Bi 
<c(r), 


as at (12). 

This is just like the situation of Section 4, except that P may depend on r and 
on W. In contrast to the earlier situation, the average expected size (34) for a 
particular » is not minimized simultaneously for all measures of size (32), but 
only for all W(r, 6) = W,(r)W2(r, 6) with a particular W: (and arbitrary W;). 


11. ONE-SIDED PROBLEMS 


Suppose r is one-dimensional and the problem is one-sided. Suppose, for 
definiteness, it does not matter if values of r below the true value are included 
in the confidence region. This suggests a measure of size of the form 


w(R) = f au(. (37) 
t>r(6),rE 


If du(r) =dr, and R is the region of all r below the value U, say, then (37) be- 

comes, for U>+r(0), ue(R) = U—7(6), the amount by which U overestimates the 

true value, and for U<7r(6), ue(I) =0. If du(r) = Wi(r)dr, (37) is equivalent to 

(32) with W(r, 0) = W,(r)W2(r, 0) where W,2(r, 0) =1 if r>7(6) and 0 otherwise. 
Under (37), (2) and (3) become 


E{uo(R(X))} = f P{R(X) D r}du(r). (38) 


r>r (0) 


(10) becomes 


f Bducecnjan = ff PARE) dD ranean 
r>r(0) 


_ f f PAX E AG) av(duCr. 
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If, for each 7, there is a uniformly most powerful test of the null hypothesis 
that 7 is the true value against alternatives py with r(0) <7, then the correspond- 
ing confidence procedure uniformly minimizes the probability of covering false 
values larger than the true value and uniformly minimizes Eo{ ws(R(X ))} for 
all 6 and all measures (37) of the amount of overestimation. Failing this, un- 
biasedness or invariance may be invoked. Alternatively one may minimize (39) 
simultaneously for all measures (37) by including + in the confidence region if 
and only if it would be accepted by the most powerful test of the null hypothesis 
that 7 is the true value against the alternative P™ given by 


P©{X EA} -f P{X € A}dv(@). (40) 
7(@)>r 


In particular, if r(@)=@ and X has density or frequency function fe(x), such a 
procedure is: 


reject ‘t f as a 


accept r f(x) <c(r). 


12. REMARKS 





(41) 


An argument sometimes made against expected length as a measure of the 
desirability of a confidence interval procedure is that, as Lehmann says [5, p. 
182], “Short intervals are desirable when they cover the true parameter value 
but not necessarily otherwise.” Jf a short confidence interval is taken to in- 
dicate accurate information about the parameter, then it may be preferable 
that the interval be long when it is far from the true parameter value. Con- 
sidering instead the probability of covering false values does not avoid this 
difficulty, however. In the first place, the two approaches are related, as shown 
in this paper. More fundamentally, small chance of covering false values is 
also desirable when the true value is covered but not necessarily otherwise. 
A natural way to avoid the difficulty would be to consider both expected 
length and the probability of covering false values conditional on the true 
value being covered. The basic relation between the two approaches still 
holds, that is: 


E{u(R(X)) | R(X) D ro} = f P{R(X) D> r| R(X) D ro}du(r) 


(42) 
- f Pixe A(r) | X © A(ro)}du(r) 


and, again, the omission of r» from the range of integration does not change the 
value of the integrals. 

I have not pursued the idea of conditioning further. The immediate reason 
is that its possible relevance only recently occurred to me, as one result of some 
questions raised in correspondence with the editor to whom I wish to express 
my gratitude. Furthermore, there is in my opinion a more important considera- 
tion than either expected length or the probability of covering false values, 
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namely, to what extent the confidence level has a meaning when a confidence 
statement is regarded as a statement about the unknown parameter 7 for the X 
observed. My real objection to the interval (15) is not that it is a poor con- 
fidence procedure according to some criterion involving the probability of 
covering false values, but that one would not ordinarily have the same con- 
fidence in the statement that r is in the interval (15) when | X/ is small and 
the interval is short as when | X| is large and the interval is long; in particular 
for X>£,, as X increases it seems to become increasingly likely that the true r 
is in the interval. This is true despite the fact that, before X is observed, the 
probability of covering the true value is exactly 1—a no matter what the true 
value (including 7 =0 if a trivial change is made). Just because a statement about 
X and + has probability 1—a for each r and hence has confidence level 1—a, 
it does not automatically follow that there is any useful sense in which con- 
fidence 1—a is attributable to the statement for each X. Though some further 
discussion of this important problem is possible within the frequency theory 
of probability, along the line of [2] and [9], for instance, I doubt that the 
frequency theory alone can really solve it. 

Though the weighting function used to select a confidence procedure in 
Section 4 and in the examples might be regarded as a prior distribution, the 
procedure obtained is generally not a Bayes procedure corresponding to this 
prior. This has already been pointed out in Sections 7 and 9 in connection with 
the second and third examples. In the first example (the normal mean, Section 
5), the weight function put all weight at 0; taking this as a prior distribution, 
the posterior distribution is also concentrated at 0, which does not correspond 
to (15). The situation is reminiscent of a method of testing hypotheses some- 
times suggested, namely, to use as the test statistic the posterior odds against 
the null hypothesis (computed under some prior), judging it not as odds but 
by the usual criteria of testing theory. 

The position in orthodox confidence interval theory of choosing a weight 
function in Section 4 seems to me like that of choosing a statistic in testing 
hypotheses or choosing a design for an experiment. That is, we may use our 
previous information and hunches as we see fit in choosing an experimental 
design, a test statistic, or a confidence procedure, provided we make the choice 
before (or as if before) taking the observations, and provided our procedure 
has the stated confidence or significance level. What are we to say, then, about 
an experimenter who thinks in advance his mean will be near 0 and accordingly 
decides to use a procedure near (15) instead of the usual confidence procedure 
for a normal mean? 
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A COMPARISON OF MAJOR UNITED STATES RELIGIOUS GROUPS* 


BERNARD LAZERWITZ 
University of Illinois 


Religious, demographic, and economic data gathered on three Sur- 
vey Research Center studies have been analyzed by major religious 
groups. The analysis indicates only slight differences in sex composition, 
marital status, and age structure among Protestants, Roman Catho- 
lics, and Jews except for a middle-aged concentration among Episco- 
palians and Presbyterians. People who report they are without a re- 
ligion are heavily male and have a larger percentage single than does 
the nation as a whole. The number of children in Jewish, Episcopalian 
and Presbyterian families is similar, but smaller than the number in the 
other religious groups. On education, occupation, and income, the 
religious groups can be separated into three descending ranks: (1) a 
top-rank, having large percentages of college graduates and white col- 
lar workers, and enjoying high incomes, composed of Episcopalians, 
Jews, and Presbyterians; (2) a middle rank, containing smaller per- 
centages of college graduates and white collar workers, and earning less 
income, consisting of Methodists, Lutherans, Roman Catholics, and 
the “no religion” group; (3) the bottom ranked Baptists with few col- 
lege graduates or white collar workers and low family incomes. 


1. INTRODUCTION 


7 task of collecting information on United States religious groups now 


rests upon research endeavors conducted outside governmental agencies. 

Among recent decisions which emphasize federal reluctance to work in this 
field are the refusal of the United States Census Bureau to release any more 
information beyond a first bulletin from its March 1957, Current Population 
Survey [16], which included questions on religious preference, and the de- 
cision not to include questions on religious preference in the 1960 Census. 

Fortunately, this information gap can be filled with date gathered on national 
probability sample surveys conducted by private survey research organiza- 
tions. While these data are based on smaller sample sizes than the rather large 
Census Bureau surveys and subject to sizable sampling errors for our smaller 
religious groups, accurate conclusions can be drawn from such findings. 

Professor Donald Bogue [3] has recently published data on religious groups 
obtained in 1955 by the National Opinion Research Center. This present article 
presents religious data gathered on three national surveys conducted by the 
Survey Research Center of The University of Michigan. 

Of the three surveys treated here, two [5, 9], conducted in the spring of 1957, 
asked each adult respondent’ for his religious preference and his denominational 
preference if the respondent was a Protestant. The third survey [13] was con- 





* This article was written while the author was a Research Associate of the Survey Research Center, University 
of Michigan. The author is indebted to the Center for use of its data and facilities and for financial support of the 
project. The opportunities for analysis of these religious data were pointed out by Dr. Angus Campbell, Director 
of the Center. 

1 The three surveys accepted interviews with respondents under 21 years of age in certain cases. However, 
the data in this article only apply to respondents 21 years of age or older. (Very few respondents were under 21 
years of age.) 


568 





MAJOR UNITED STATES RELIGIOUS GROUPS 569 


ducted in November, 1958, and asked only for religious preference.? This latter 
survey is included to increase the number of Roman Catholic, Jewish, and 
“no religion” respondents.* 


2. FINDINGS 


Table 1 presents the sex and marital status composition of the major United 
States religious groups. Note the relatively heavy female concentrations, in 
contrast to the national percentage, found for Negro Baptists,‘ and Episco- 
palians. The opposite extreme is found in the “no religion” group which is 
heavily male. Again, Negro Baptists and the “no religion” category rank rather 
low (in contrast to the national figure) on per cent married. The Negro Baptists 


TABLE 1. SEX COMPOSITION AND MARITAL STATUS FOR ADULTS 
OF MAJOR UNITED STATES RELIGIOUS GROUPS, DECEMBER, 1957 


(in percentages) 
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Nation 5 54 
Protestants 
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Whites 
Negroes 
Methodists 
Lutherans 
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No Religion 
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rank high on separated, divorced, and widowed; adults with no religion rank 
high on per cents single and divorced. 

Age composition is given in Table 2. The Episcopalians are rather low in 
the 21 to 29 age categories and high in the 40 to 54 age categories in contrast to 
the national (or Protestant) figures. Similarly, the Presbyterians are high in 
the 55 years old and over brackets. 

Number of children presently in a respondent’s home and respondent’s 
family life cycle stage are reported in Tables 3 and 4. Presbyterians, Episcopal- 
ians, and Jews have the smallest percentage of families with three or more chil- 





2 The lack of a question on Protestant d inational preference in the November, 1958 survey confines de- 
nominational data to the first two studies. The reader should bear this in mind when contrasting the number of 
interviews reported for all Protestants with that reported for specific Protestant denominations. Furthermore, 
only the Pr tant included in this article's tables were specifically coded on the two earlier surveys. 
Information on additional denominations can only be obtained by an expensive recoding process. 

+ Comparisons of the sets of percentages obtained on the three surveys indicate the passage of 18 months 
produced only random changes. With these surveys combined into one, the data may be thought of as referring to 
& point in time midway between the spring of 1957, and November, 1958, namely, December, 1957. 

4 The Baptists are separated by race because Negroes form 24 per cent of this denomination. Negroes comprise 
only 10 per cent of the Methodist denomination so that controlling for race did not change white or over-all Metho- 
dist percentages in any meaningful degree. 
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dren and relatively large percentages of their families with two children. The 
Methodists and the over-all Protestant percentages are between the figures 
reported for Presbyterians, Episcopalians, and Jews and the larger family sizes 
recorded for the “no religion,” Lutheran, white and Negro Baptists, and Ro- 
man Catholic groups. 

Note that Episcopalians, Presbyterians, and the adults with no religion 
groups have fairly high percentages in the no children category. For the “no 
religion” group, this concentration undoubtedly arises out of its dispropor- 
tionate numbers of males and single people. For the other two religious groups 
a more complicated explanation seems in order which Table 4 helps provide. 

Episcopalians have the largest percentages with no children in the home 
and respondent from 35 to 64 years of age; Presbyterians have the largest 


TABLE 2. AGE COMPOSITION OF ADULTS OF MAJOR UNITED 
STATES RELIGIOUS GROUPS, DECEMBER, 1957 


(in percentages) 
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Religious Groups y ‘ y 30-34 | 35-39 | 40-44 { 50-54 | 55-59 | 60-64 ‘ Total 





Nation : 2 12 13 9 
Protestants . r : ll 12 10 
Baptists K 12 12 9 
Whites q ¢ 2 q ll 12 9 
Negroes f 14 10 11 
Methodists y ll ll 10 
Lutherans K é 9 12 
Presbyterians f q 9 13 9 
Episcopalians 2 13 17 y 11 
Roman Catholics q 12 15 8 
Jews 16 16 
No Religion ¢ 12 12 11 
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percentage with no children in the home and respondent 65 years of age or 
older (and the second largest percentage in the previous life cycle category). 
These two denominations also have disproportionate concentrations in the over 
45 years of age brackets as shown by Table 2. 

While the data presented here cannot explain these denominational char- 
acteristics, they do suggest this possible hypothesis: that the Episcopalians 
and, to a lesser degree, the Presbyterians gain a fair number of new, middle- 
aged members from other Protestant denominations. Such recruits should 
have advanced their social and economic lot in life and would be seeking added 
status through membership in the Episcopalian or Presbyterian churches.® 





5 In forming Table 4 study codes only permitted placing families into the following eight life cycle categories: 
(1) respondent single and under 35 years of age; (2) respondent married and under 35 years of age but with no 
children in the home; (3) respondent married and with at least one child under 5 years of age in the home; (4) 
respondent married and all children in the home 5 years of age or older; (5) respondent not now married but does 
have children in his home; (6), (7) and (8) are categories in which there are no children now in the respondent's 
home, and the respondent is of the indicated ages. 

* Dr. Niles Carpenter, consultant for the General Division of Research and Field Study of the Episcopal 
Church, in correspondence with the author has expressed the opinion that the concentration of older adults in the 
Episcopal church (and, perhaps, the Presbyterian church) is a result of several factors besides status seeking. These 
additional factors are (1) the attraction of the Episcopalian church for dissident Roman Catholics; (2) the more 
liberal Episcopalian attitude toward “worldly behavior”; (3) the Episcopalian tradition of having a highly educated 
clergy which serves to attract and retain well educated laity. ‘ 
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That these two churches do have large proportions of high status people is 
shown by the remaining tables. 

Large differences appear in Table 5 which gives amount of schooling. A 
status array can be formed with Episcopalians on top having 53 per cent with 
one or more years of college, followed by Presbyterians with 36 per cent, and 
Jews with 33 per cent. In the middle come people with no religion, Methodists, 
Lutherans, and Roman Catholics. On the bottom are the Baptists, both whites 
and Negroes. Finally, note the large concentration exhibited by the “no 
religion” group in the 0-8 grades category. Unlike the others, this group shows 
concentrations at both the high and low ends of the education scale. 

This interesting ranking of the religious groups recurs on occupation of 
family head shown in Table 6. The Episcopalians have the largest professional 


TABLE 3. NUMBER OF CHILDREN IN THE HOMES OF MAJOR 
UNITED STATES RELIGIOUS GROUPS, DECEMBER, 1957 


(in percentages) 
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Nation 5827 16 22 
Protestants 4185 16 22 
Baptists 939 q 25 
Whites 713 f ¢ 25 
Negroes 226 29 
Methodists 730 j ¢ 21 
Lutherans 328 24 
Presbyterians 272 } 16 
Episcopalians 119 15 
Roman Catholics 1270 f ¢ 27 
Jews 188 16 
No Religion 125 23 





























percentage, followed by the Jews, “no religion,” and then the Presbyterians. 
The Jews have the largest owners, managers, and officials percentage, followed 
by the Episcopalians, and Presbyterians. The Episcopalians, Presbyterians, 
and Jews have very few farmers. 

In the middle are found the Catholic, Methodist, and Lutheran groups. On 
the bottom of the occupational pile come the white and Negro Baptists. 

Forty-two per cent of the Catholics are found in the skilled and semi-skilled 
categories with the rest distributed throughout the other groupings. Catholics, 
too, have few farmers, and their occupational distribution resembles the white 
Baptists except for the latter’s large farm percentage. 

As would be expected, the Negro Baptists have the largest unskilled per- 
centage among all religious groups, together with the lowest percentages in 
the three white collar categories. 

The “no religion” group continues to exhibit an atypical pattern. While out- 
ranking the Presbyterians in the professional category, it drops below the level 
of the white Baptists on owners, managers, and officials, clerical and sales 
occupations. 





572 AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1961 


This hierarchical pattern is further borne out by Table 7, on total family 
income. On top, once more, are the Episcopalians with 46 per cent of their 
families earning $7500 or more, followed by the Jews with 42 per cent, and 
the Presbyterians with 26 per cent of their families earning $7500 or more. 

In the middle, one finds “no religion,” Catholies, Lutherans, and Methodists. 
On the bottom, again, are white and Negro Baptists. 

Once more, adults with no religion exhibit a concentration at both ends of 
the economic continuum. Twenty-one per cent of them earn under $2000 as 


TABLE 4. FAMILY LIFE CYCLE FOR MAJOR UNITED STATES 
RELIGIOUS GROUPS, DECEMBER, 1957 


(in percentages) 
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Nation 5827 
Protestants 4185 
Baptists 939 
Whites 713 
Negroes 226 
Methodists 730 
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contrasted to the national figure of 15 per cent, and 22 per cent earn $7500 or 
more in comparison to the national figure of 19 per cent. 

Table 8, presents data on church attendance. The national pattern of 45 
per cent attending church regularly (once a week or mure); 21 per cent attend- 
ing often (once, twice, or three times a month); 26 per cent attending seldom 
(a few times a year or less); and 8 per cent never attending is a Protestant 
pattern. Roman Catholics, as has been well documented, are regular attenders, 
and Jews seldom attend.’ 

In other words, four patterns of church attendance can be found in the 
United States: 

(1) The Catholic pattern typified by about 75 per cent attending church 
weekly. 

2) The Protestant pattern which has one-third to 40 per cent attending 





1 For studies of more restricted populations that have investigated the question of church attendance see 
[1, 2, 4, 6, 12, 14, 15}. 
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weekly, 20 to 30 per cent attending often and seldom, and 5 to 10 per cent re- 
porting no attendance. 

(3) The Jewish pattern which is predominately an annual “High Holiday” 
attendance. 

(4) Adults with no religion who rarely (if ever) attend church services. 


3. SUMMARY 
The sex, marital status, and age distribution data indicate few major dif- 


TABLE 5. AMOUNT OF EDUCATION FOR ADULTS OF MAJOR 
UNITED STATES RELIGIOUS GROUPS, DECEMBER, 1957 


(in percentages) 








Some | 4 Yrs. Y 4 Yrs. or 
Religious Groups N 0-8 | High | High oA T8-! More of 
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Grades 





Nation 5827 10 
Protestants 4185 10 
Baptists 939 7 
Whites 713 7 
Negroes 226 3 
Methodists 730 10 
Lutherans 328 9 
Presbyterians 272 20 
Episcopalians 119 25 
Roman Catholics 1270 9 
Jews 188 17 16 
No Religion 125 10 12 


























ferences among the religious groups. The small percentage of young Episcopal- 
ians and the large percentages of middle-aged Episcopalians and Presbyterians 
compared to the national percentages could be a result of sampling error. How- 
ever, the repetitions of similar concentrations in the tables on number of 
children and family life cycle give emphasis to the hypothesis that other 
Protestants join the Episcopal and, to a lesser extent, the Presbyterian churches 
after a rise in social and economic status. 

The relatively low sex ratio among Negro Baptists would well be a result 
of under representation of Negro males in the three surveys.* The pattern of 
more broken marriages among Negro Baptists is in accord with recent Census 
reports.® 

The differences among the religious groups on numbers of children accord 
with the results presented in the recent Freedman, Whelpton, Campbell book 
{7]. Table 3 indicates that Jews have the highest percentage of two children 
families; the Catholics have a high percentage of their families with three or 
more children, and the white Baptists, Lutherans, and Methodists fall be- 

* Survey Research Center national samples usually obtain slightly fewer adult males (1 or 2 per cent) than 
would be expected on the basis of Census reports. The Census Bureau and survey organizations find that the greater 
mobility of Negro men produces a further reduction in sample yields for this population group. 


* For instance, United States Bureau of the Census, “Marital Status and Family Status: March, 1958,” 
Current Population Reports, Series P-20, No. 87. 
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TABLE 6. OCCUPATION OF FAMILY HEADS FOR MAJOR UNITED 
STATES RELIGIOUS GROUPS, DECEMBER, 1957 


(in percentages) 
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tween the Catholics and Jews. Negro Baptists exceed Catholics on percentage 


of families with three or more children. 

The high status, highly urbanized Episcopalians and Presbyterians display 
the small family pattern which characterizes the Jews. This, too, conforms 
with the conclusions reached by Goldberg [8 ]. 

The education, occupation, and income data sort the religious groups intoa 
social hierarchy. Setting aside the “no religion” group for the moment, one can 
form the following ranks: 

Top: Episcopalians 
Jews 
Presbyterians 


TABLE 7. TOTAL FAMILY INCOME IN 1956 FOR MAJOR UNITED 
STATES RELIGIOUS GROUPS 


(in percentages) 
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TABLE 8. CHURCH ATTENDANCE DURING 1957 FOR ADULTS OF 
MAJOR UNITED STATES RELIGIOUS GROUPS 


(in pasnentages) 


| a 








Religious Groups 





| Regularly Often 


Seldom Never 





Nation 5827 45 21 | 26 

Protestants 85 39 | 23 30 
Baptists é $1 31 | 22 
Methodists ee ie Ee 
Lutherans 32 39 29 
Presbyterians 7 41 20 | 
Episcopalians ¢ | 33 | 23 

Roman Catholics 27 72 13 

Jews § | 13 2 

No Religion 25 | 2 








® Nonresponse is classified as never. 


Middle: Methodists 
Lutherans 
Roman Catholics 

Bottom: White Baptists 
Negro Baptists 


The differences among the Methodists, Lutherans, and Catholics are small. 
The ranks in the above scale together with the data recorded in Tables 4 to 7 
(except for sampling variations and some coding differences) are in substantial 
agreement with those reported by Bogue [3]. 

The bottom position of Negro and white Baptists comes as no surprise. 
The unfortunate position of the Negro in our society is well documented. White 
Baptists are a heavily Southern group—69 per cent living in the South in 1957 
according to Bogue’s data and the South is our underprivileged region. The 
very high position of United States Jewry may come as a surprise to some. 
When one remembers that heavy Catholic and Jewish immigration began 
about the same time (the 1840’s for Irish Catholics and German Jews; the 
1880’s for Eastern European Jews and other Catholic groups), the extremely 
rapid rise of the Jewish group becomes quite extraordinary.'® 

Finally, a word about the “no religion” group is in order. Apparently, this 
group is capable of being subdivided into at least two more homogeneous sub- 
groups. The first subgroup seems to be composed of well-educated, fairly high 
income, professional people who account for the “top-level” concentrations 
previously noted. The other subgroup seems to be composed of poorly-educated, 
low income, blue-collar workers and farmers. This latter subgroup would ac- 





* The three surveys also indicate that 31 per cent of the Jews, 16 per cent of the Catholics, and 3 per cent 
of the Protestants are foreign-born. It should be borne in mind that numerical size can serve as a limiting factor on 
& group’s social position. Relatively large religious groups, such as the Catholics or Lutherans, should have a more 
difficult time finding enough “better-off” social “slots” to enable as great a percentage of their groups to rank as 
high on education, occupation, and income as the smaller Episcopalian and Jewish groups. The numerically smaller 
groups, such as Episcopalians and Jews, can find a comparatively larger number of high status social “slots” rela- 
tive to their numbers. On the other hand, one would expect each religious group to form as great a percentage, by 
chance alone, of the various social status positions as it does of the nation’s population. Clearly, this is not the case. 
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count for the large percentage of adults reporting no religion who belong to 
families having three or more children. 

It is hoped that these survey data will aid in removing some of the questions 
presently asked about United States religious groups. 


4, APPENDIX 


A. Sample Design: One national sample was selected for both 1957 surveys; 
it was then randomly divided at the final sampling stage for the two surveys. 
Another sample, selected from the same primary and secondary sampling 
units used for the 1957 studies, was used for the November, 1958 survey. The 
two sample designs used the 66 primary sampling units employed on standard 
Survey Research Center studies; the selection procedures made use of the city 
directory and segmentation techniques developed by the Sampling Section of 


TABLE 9. APPROXIMATE SAMPLING ERROR* OF PERCENTAGES 


(in percentages) 
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* The figures in this table represent two standard errors. Hence, for most items the chances are 95 in 100 that 
the value being estimated lies within a range equal to the reported percentages, plus or minus the sampling error. 

Two estimates cf the sampling error are presented for each cell. The lower values are based on the standard 
error formula for simple random samples. The higher values are based on the computations of individual sampling 
errors carried out on the study data, and allow for the departure from simple random sampling in the survey design 
such as stratification and clustering. 


the Survey Research Center. Since it is not possible to describe completely the 
designs of these samples in this brief paper, the interested reader is referred 
to [9, 10] for such a detailed presentation. 


B. Schedule Questions: All interviewing was done by the regular field staff 
of the Survey Research Center. The religion question asked on the two 1957 
surveys was: “What is your religious preference?” On the 1958 survey the ques- 
tion was: “Is your church preference Protestant, Catholic or Jewish?” Boxes 
being also provided for “other” or “none” responses to this question although 
they were not part of the question proper. In addition, interviewers for the 
two 1957 surveys were instructed to ask this question if a respondent stated 
he (or she) was a Protestant: “What religious denomination is that?” 

The Census Bureau survey [16] used a similar question—“What is your 
religion?” The Census Bureau, Bogue, and the Survey Research Center all 
report that respondents very readily accepted the question pertaining to 
their religion. It is doubtful that these slight differences in question wording 
among the various surveys differentially affected respondents’ replies in any 
significant degree. 
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TABLE 10. APPROXIMATE SAMPLING ERROR* OF DIFFERENCES 
BETWEEN PERCENTAGES 


(in percentages) 
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® The values shown are the differences required for significance (two standard errors) in comparisons of per- 
centages derived from two different subgroups of the study. Two values—low and high—are given for each cell. 
See note a to Table 9. 


C. Sampling Errors: Tables 9 and 10 give the sampling errors for individual 
percentages and for the difference between two percentages for varying numbers 
of interviews. The low level estimate found in the table cells gives the 95 per 
cent confidence limits based upon the standard simple random sample for- 
mula. The high level estimate takes into consideration the additional amount 
of variance derived from the use of a clustered sample; it, too, gives 95 per cent 
confidence limits. The procedures and statistical formulas used to obtain 
these sampling errors are presented in Kish and Hess [11]. 

To illustrate the use of the tables, let us find the sampling error for the 20 
per cent of the Methodists who have some high school training. Since the total 
number of Methodist interviews is 730, we enter the column of Table 9 headed 
700 and the row headed 20 or 80. This tells us that chances are 95 out of 100 
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that the 20 per cent is subject to a sampling error of plus or minus 4.2 per cent’ 
(using the high level estimate). Again, to find the sampling error of the dif- 
ference between the 63 per cent of the Negro Baptists who have 0-8 grades of 
education, based on 226 interviews, and the 35 per cent of the Lutherans who 
have 0-8 grades of education, based on 328 interviews, we enter that section 
of Table 10 headed “For Percentages from 35% to 65%.” Finding the cell at 
the intersection of 300 (row) and 200 (column), we note that the chances are 
95 out of 100 that the difference between 63 per cent and 35 per cent, namely 
28 per cent, is subject to a sampling error of plus or minus 12.7 per cent (using 
the high level estimate). Since 28 per cent exceeds 12.7 per cent, there is a 
significant difference between the Negro Baptists and Lutherans on per cent 
having 0-8 grades of education. 

Frequently, the difference between two percentages of the data presented in 
Tables 1 to 8 exceeds the proper high level estimate of sampling error. Hence, 
the two percentages can be considered significantly different at a 95 per cent 
confidence level. Occasionally, some of the conclusions presented are not 
based upon percentages whose differences exceed their proper high level es- 
timates. In all such cases, the percentages follow clearly discernable patterns, 
and it is probably correct to assert that such differences among religious groups 
are not functions of sampling variations. 


D. Comparison with Census Bureau Survey: An interesting verification of the 
samples discussed in this article is found in Table 11 which compares the 


TABLE 11. COMPARISON OF CENSUS BUREAU AND SURVEY 
RESEARCH CENTER STUDIES 
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Census 
Bureau* March April November 
1957 1957 1958 
Protestants 6. 70.5 71. 

Baptists 9. 21.1 21. 

Methodists , 16.3 17. 

Lutherans : 8.1 6.£ 

Presbyterians 5.6 6.5 5 

Episcopalians 2. 

Other Protestants 9.8 15.6 
Roman Catholics 5. 22 .6 
Jews 3. | 3.6 
Other Religions , 0.§ 
No Religion ‘ &. 
Religion Not Reported ‘ 0. 
Nation ‘ 100. 
Sample Size : 2,458 


Religious Groups 











* Census Bureau percentages pertain to the civi!ian population 14 years old or over as of March, 1957. 

> Survey Research Center percentages pertain to the civ..ian population 21 years old or over as of the indicated 
dates. 

® The Census Bureau has not published a separate Episcopalian percentage. 

4 Protestant denomination not asked on this survey. 
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religious percentage breakdowns obtained by the Census Bureau with the three 
surveys treated here. The Survey Research Center surveys show a larger per- 
centage of Protestants and a smaller percentage of Catholics than does the 
Census survey. These differences probably arise out of the different age groups 
being reported upon (the Census Bureau reports on persons 14 years old or 
over; the Survey Research Center reports on persons 21 years old or over), 
sampling errors, and from slightly different procedural definitions employed 
by the two organizations. 
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FURTHER COMMENTS ON THE “FINAL REPORT OF THE 
ADVISORY COMMITTEE ON WEATHER CONTROL”* 


Jerzy NEYMAN AND Evizasets L. Scorr 
University of California, Berkeley 


The Final Report contains only two assertative conclusions: A(1) that 
commercial cloud seeding in mountainous areas produced 10-15 per 
cent increase in precipitation, and A(3) that no seeding project meant 
to increase precipitation was found in which the indicated effect of 
cloud seeding was negative. With reference to A(1) evidence is intro- 
duced showing that the apparent 10-15 per cent increase in precipita- 
tion is likely to be due to a combination of factors other than cloud 
seeding. Conclusion A(3) was found inconsistent with some of the 
Advisory Committee’s own findings and with some other published 
results. Section 8 indicates that a Bayes decision rule whether to seed 
or not, published in Volume II of the Final Report as minimizing the 
risks, may well maximize these risks. Section 9 indicates a misrepresen- 
tation and a significant omission in Volume II of the Final Report. 


1. INTRODUCTION 


N HIs review article [6] on the “Final Report of the Advisory Committee on 

Weather Control” (Final Report, for short) [10], Professor Brownlee ques- 
tions the validity of certain findings of the Advisory Committee. We are in 
perfect agreement with Brownlee, but we think that he did not cover all the 
grounds for legitimate skepticism in the Final Report. Our purpose is 

(i) To reinforce Brownlee’s criticism of the Advisory Committee’s main con- 
clusion A(1) by documenting some of Brownlee’s statements and by bringing 
out additional arguments against the validity of their conclusion. 

(ii) To question another conclusion of the Committee, namely conclusion 
A(3), which was left without comment by Brownlee. 

(iii) To emphasize the ineptness, briefly mentioned by Brownlee, of a par- 
ticular study published in the Final Report. 

(iv) To indicate a misrepresentation and an omission in articles published in 
the Final Report. 

If and when methods of artificial weather modification are established, this 
will mean untold benefits to humanity. The pioneer experiments conducted at 
the General Electric Company by Langmuir, Schaefer, Vonnegut and others 
suggest that some kind of cloud seeding may be effective in increasing rainfall. 
Also, more recent randomized experiments by the Chicago meteorological 
group [4] indicate that seeding with water droplets of semitropical cumulous 
clouds helps to initiate rain. On the other hand, to our knowledge, thus far 
there is no real evidence that cloud seeding with silver iodide released from 
ground generators has any effect on precipitation. 

In our opinion, the solution of the problem of weather modification may come 
about only through meticulously planned and executed randomized experi- 
ments involving not only measurements of precipitation but also simultaneous 
observations on a number of accompanying phenomena (condensation nuclei 





* This paper was prepared with the partial support of the National Scienve Foundation, Grant G-8211. 
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in the atmosphere, humidity, temperature gradient, etc.). Such experiments 
pose problems of considerable complexity. The ail-important raingages, a 
large number of them scattered over a substantial area, must be effectively pro- 
tected against all possible interference. It may be necessary to build roads so 
that these raingages can be serviced irrespective of weather. Also, it is neces- 
sary to arrange that the heterogeneous interests of the population in the given 
area do not interfere with the plan of the experiments. For example, a suddenly 
initiated commercial cloud seeding program in an adjoining county may be 
such an interference. (This is exactly what happened in a particular experiment 
[17] with which we are familiar.) Law suits for alleged damages to crops may 
be another. It is obvious that these and similar practical difficulties, not to 
speak of instrumentation and of the performance of the experiments, involve 
considerable costs. In our opinion, satisfactory weather modification experi- 
ments can be conducted only through a powerful national organization, such 
as the National Center for Atmospheric Research, once proposed by a Com- 
mittee of the National Academy. Calculations referring to nonrandomized 
commercial cloud seeding projects lead, at best, to inconclusive conjectures 
and, at worst, to promotional literature. The statistical part of the Final Report 
of the Advisory Committee is based entirely on an analysis of commercial 
cloud seeding projects. 


2. CONCLUSIONS OF THE ADVISORY COMMITTEE 


The two conclusions A(1) and A(3) are the only assertative conclusions of 
the Advisory Committee itself, as contrasted with the conclusions of its staff. 
They are as follows: 

“A(1) The statistical procedures employed indicated that the seeding (by 
commercial operators) of winter-type storm clouds in mountainous areas in 
Western United States produced an average increase in precipitation of 10 to 
15 per cent from seeded storms, with heavy odds that this increase was not the 
result of natural variations in the amount of rainfall.” 

“A(3) No evidence was found in the evaluation of any Project which was 
intended to increase precipitation that cloud seeding had produced a detectable 
negative effect on precipitation.” 

The Advisory Committee’s conclusion A(1) is based on statistical procedures 
described by H. C. 8. Thom in his papers [20] and [21] published in the second 
volume of the Final Report. They consist in the application of the so-called 
historical regression method. Conclusion A(1) was criticized by Brownlee on 
several counts, particularly because it is not based on a randomized experi- 
ment. Our own criticism of the conclusion A(1) goes slightly farther. 


3. BROWNLEE’S CRITICISM OF CONCLUSION A(1) 


Brownlee’s questioning of the reliability of the Advisory Committee’s con- 
clusion A(1), that cloud seeding (rather than some other factor) actually is 
responsible for a 10 to 15 per cent increase in precipitation, because this conclu- 
sion is not based on a randomized experiment is, of course, indisputable. Brown- 
lee also indicates a possible mechanism whereby the apparent increase in pre- 
cipitation ascribed to seeding may be due to another factor, namely to success 
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in forecasting. This mechanism is based on the premise that the commercial 
operators seed only a fraction of all storms passing over the target, those that 
they select themselves as “seedable.” Then the target and the control precipita- 
tion from seeded storms are compared with the “historical regression line” of 
the precipitation in the target on that in the control established for all storms 
over a period of time during the preseeding era. Naturally, if the cloud seeder 
has any ability as a forecaster, the storms selected by him for seeding may 
easily be those depositing relatively more precipitation in the target than in 
the surrounding areas and, as the number of observations is increased, the 
statistical test will detect a significant excess of precipitation from seeded 
storms, even if the actual effect of seeding is nonexistent. 

Another possibility, discussed in the next section, is that seeding has an 
effect but only on certain types of storms, and that seeders successfully select 
these (or at least some of them). Also possible is a combination of these two 
mechanisms. 

Returning to the plausible mechanism that the cloud seeder may be able to 
select for seeding those storms that will deposit relatively more precipitation in 
the target than in the surrounding area, this possibility alone is enough to in- 
validate the historical regression method on which conclusion A(1) of the 
Advisory Committee is based. In fact, it is likely to apply to some cloud seeding 
operators though, perhaps, not to all. This mechanism, capable of vitiating the 
conclusions regarding the effectiveness of cloud seeding, has been mentioned 
before, at least since 1951 [15], and in his paper [20] Thom offers a defense. 
His main argument is that, apart from a few cases when there is a danger of 
floods, the commercial cloud seeding operators seed all the storms passing over 
the target. Thom’s denial of selection of storms to be seeded has been pointed 
out to us by one of the referees of this paper. Here, then, a certain amount of 
documentation is needed in order to establish the facts. 

In 1951-55 we were engaged in the study of the methodology of the evalua- 
tion of cloud seeding operations undertaken for the Division of Water Re- 
sources, Department of Public Works of the State of California. The first report 
[15] dated December 1951 was published in 1954. Further results, partly super- 
seding the first report, appeared [12] in 1955. Since the methodology of evalua- 
tion could not be studied in the abstract, three particular cloud seeding projects 
were studied in detail, the Carrizo Plain project, the Santa Barbara project, 
and the Southern Sierra project all in the seasons 1950-52. The Santa Bar- 
bara project was evaluated by the Statistical Laboratory jointly with the Divi- 
sion of Water Resources [9]. The work on the Southern Sierra project is due to 
Arnold Court [10]. Significant differences between seeded target precipitation 
and the expectations derived from historical regression lines were found in the 
first two projects but not in the third. 

In these three projects, the frequency of seeded storms is as given in Table 1. 

The Santa Barbara project is included in the sample of cloud seeding opera- 
tors evaluated by Thom. 

The storms in the above tabulation were defined partly by Edward M. 
Vernon, then Chief Regional Forecaster of the U. 8. Weather Bureau in San 
Francisco, and partly by Arnold Court, Meteorological Consultant of the Sta- 
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TABLE 1. FREQUENCY OF SEEDED STORMS 








Total number Number of Per cent of 


Project of storms storms seeded storms seeded 








Carrizo Plain ¢ 64 65 
Santa Barbara ¢ 53 54 
Southern Sierra 24 46 


Total 141 57 




















tistical Laboratory. However, the definition of a storm period is always subjec- 
tive and debatable. Table 2 gives unambiguous data on the frequency of seed- 
ing, showing the distribution of days covered by contracts for seeding in all 
the projects in California in the 1951-52 precipitation year which were in- 
tended to increase precipitation and in which silver iodide smoke was used. 
The data for the table were obtained from precipitation records of the U. 8. 
Weather Bureau and from the official records of the Division of Water Re- 
sources [11] as printed in 1954. 

It appears then, as reflected in the official logs of cloud seeding operations, 
that in the seeding season of 1951-52 which is covered by the evaluation of 
Thom, it was the practice of commercial cloud seeding operators in California 
to seed only about 60 per cent of days during which there is a substantial rain 
in the target. Recent publications [8] and [16] iridicate that, at least with some 
operators, this practice persists up to now. The paper [8] is especially relevant 
because it gives the rule applied by the particular operator to select seedable 
storms, namely those in which the —5°C level is lower than 12,000 feet. 

A possible reason for letting about 40 per cent of the storms go without seed- 
ing may be the custom of commercial cloud seeders to report to their customers 
on their achievements storm by storm. As a result, if a particular storm does 
not offer good prospects for the effect of seeding, it is better to abstain from 
seeding. Silver iodide is believed effective only in supercooled clouds, in tem- 
peratures below —4°C. If these temperatures obtain only at very high levels, 
with little chance of the smoke released from the ground reaching the super- 
cooled clouds, there is no reason to seed. 

The problem of seedability of storms has a substantial literature. In particu- 


TABLE 2. CLASSIFICATION OF UNITS OF OBSERVATION ACCORDING 
TO AVERAGE RAIN IN THE TARGET AND SEEDING 








All commercial orerations in California 1951-52 





Average precipitation . 
in inches Total no. of days 


ile iataliendl No. days seeded Per cent seeded 





Exactly zero 640 52 
0.00-0 .03 93 21 
over 0.03 303 167 
over 0.10 220 130 
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lar a sixty-two page article [13] under the title “Nuclei, Cumulus and Seeda- 
bility Studies” is published in Volume II of the Final Report. 

The general conclusion seems to be that, at least in California, the commer- 
cial cloud seeding operators do make efforts to select for seeding only those 
storms that offer a good prospect for success and it seems unreasonable not to 
do so. Whether they are successful in their selection is another matter. As indi- 
cated by Brownlee [6] there is evidence [17] that in some cases the selection 
is effective. 


4. TYPES OF STORMS AND AN ALTERNATIVE MECHANISM WHEREBY THE 
APPARENT EXCESS IN PRECIPITATION ASCRIBED TO SEEDING MAY 
BE DUE TO ANOTHER FACTOR 


While success in forecasting is a possible alternative to the real effects of 
seeding in creating significant deviations from the historical regression line, in 
the present section we produce certain data suggesting that these deviations, 
occasionally positive and occasionally negative, may actually be due to another 
mechanism, also mentioned by Brownlee. This alternative mechanism is based 
on the year-to-year variation in the frequency of certain types of storms. There 
is little doubt that this mechanism operates in certain areas, particularly where 
there are mountains. Whether or not the numerical effect of the mechanism of 
varying frequency of types of storms is strong enough to account for significant 
deviations of seeded precipitation from the historical regression lines is not 
definitely established. To do so, a substantial numerical calculations program 
would be necessary. This program would involve computing historical target- 
control regression lines for varying periods, say 7’, in the preseeding era and in 
comparing them with the target precipitation observed in nonoverlapping some 
what shorter periods ¢ of time, also before the era of cloud seeding. Then the fre- 
quency of significant deviations, particularly if it is supported by meteorologi- 
cal analysis of storms, may establish that the mechanism of types of storms 
does have effects, strong enough to produce the appearance of effectiveness of 
cloud seeding even if, in actual fact, there is none. On the other hand, of course, 
if in a number of instances no significant differences between periods 7' and ¢ 
are found, this would not constitute evidence that no such differences prevail 
in other cases. 

The phenomenon of types of storms is interesting not only as a possible 
factor invalidating the method of historical regressions. In addition, there is 
the possibility that the success of cloud seeding depends upon the type of storm. 
It is for these two reasons combined that in the present section we produce cir- 
cumstantial evidence that the mechanism of varying frequency in the types of 
storms probably was operating in several instances when significant apparent 
effects of seeding were found. We must begin with a brief description of types 
of storms and of the manner in which they are identified. 

During our work for the Division of Water Resources we noticed that for a 
given combination of target and control areas, particularly in mountainous 
regions, it is possible to identify objectively several different types of storms, 
each characterized by a different target-control regression line: for some types 
of storms these regression lines are higher than for others. 
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In order to get an intuitive feeling of this phenomenon it is sufficient to vis- 
ualize a target and a control area separated by a mountain ridge running, ap- 
proximately, on a straight line. The target-control regression line for storms 
with the moisture flow perpendicular te the mountain barrier, passing first over 
the control area and then over the target, will be lower than the regression line 
corresponding to storms with the moisture flow in the opposite direction. 

These two types of storms, with the moisture flow perpendicular to the 
mountain barrier, will be the extreme types, one most favorable and the other 
least favorable to the target. Between these two extremes there will be a num- 
ber of intermediate types, with moisture flow at an angle to the chains of 
mountains. Frequently, the configuration of the target and the control area is 
much more complicated than visualized above. In particular, in the Santa 
Barbara project a part of the target is literally crisscrossed with mountains in 
all directions so that, with a fixed central area, a given storm may strongly 
favor one part of the target and be very unfavorable to another part. 

The situation is complicated by the fact that the direction of air flow is not 
the only factor determining the type of storm. The first attempt at a definition 
of types of storms is due to Arnold Court. Subsequently, at our request, two 
systems of types of storms were constructed for the Carrizo Plain project, one 
by Edward M. Vernon [22] and the other by Dr. Morris Neiburger, Professor of 
Meteorology, University of California, Los Angeles [14]. Modifications of these 
systems for the two other projects were prepared by one of the authors and by 
Arnold Court. In each case, the classification of a given storm into one of the 
types depends upon such factors as differences in pressure at specified points, 
upper wind direction, etc., but not precipitation. 

The work of Court, Vernon, and Neiburger seems to have established the 
existence of types of storms which favor to a different degree a given target 
compared to a given control area. In particular, the validity of Vernon’s defini- 
tion of types relating to the Carrizo Plain area has been checked by the staff 
of the Advisory Committee. According to Thom [20, p. 34], Vernon’s types 
were verified on data which were not available to us, presumably war-time 
data. 

While the existence of different types of storms, each with a different target- 
control regression, is not disputed, there is considerable uncertainty as to the 
best system of criteria for the classification of storms. In particular, Thom 
claims to have established criteria which are more effective than Vernon’s, how- 
ever, no details are published. 

For the Carrizo Plain, Vernon distinguishes two broad categories of storms 
designated by Rorman numerals I and II. He expected the first to be generally 
favorable to the target and the second generally unfavorable. In order to be in- 
cluded in category II, a storm must be characterized by atmospheric pressure 
(at the start of the storm) at Santa Maria exceeding those at San Francisco, at 
Eureka and at an observation point in the Pacific Ocean with coordinates 40°N 
and 130°W. All other storms are classified in I. 

Both categories I and II are subdivided into three subcategories distinguished 
by letters a, b, and c. There the distinction is based on atmospheric pressure 
and upper wind direction at Santa Maria. In both categories the letter “a” 
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indicates the extreme. Thus the types Ia and IIa were expected by Mr. Vernon 
to be the extremely favorable and the extremely unfavorable to the target, re- 
spectively. 

It will be noticed that the above classification of storms and, in fact, any 
other classification that may be attempted, is affected by the scarcity of meteoro- 
logical data. The four stations at which the pressure needed for Vernon’s classi- 
fication is taken are hundreds of miles away from each other and the station 
nearest to the Carrizo Plain, namely Santa Maria, is some one hundred miles 
distant from the target-control area. In these circumstances, an attempt at an 
objective classification of storms is comparable to an attempt, say, to classify 
Europeans into the many national groups using several measurements of their 
bodies. The inevitable result would be that Europeans classified as “French” 
will include a very substantial admixture of other nations and vice versa. Thus, 
one should not be surprised to find that any of Vernon’s types contains a sub- 
stantial number of storms which should have been classified differently. Also, 
one should not be surprised to find that some of the criteria defined by Vernon 
fail to separate the storms into different types as regards the target-control 
regression lines. 

In order to check whether the expectations of Mr. Vernon were justified, 
historical regression for each of the six types of storms were obtained. T. A. 
Jeeves’ contribution to our joint work consisted in adjusting the Scheffé multi- 
ple decision procedures in order to pick out those subtypes of storms which 
showed any indication of difference in the target-control regressions. As a result 


of this test, the original six basic types defined by Vernon were combined [12] 
into three “operational types” as follows 


A=I,+I, favorable to the target 
B=I,+II.+I1], intermediate 
C=II, unfavorable to the target. 


It is relevant for what follows that the operational type C may be expected 
to be more homogeneous than either type A or B. 

Following the establishment of different types of storms the next step in our 
study was to check whether the frequency of different types is constant or 
varies from year to year. As expected, the variation is considerable. In fact, it 
is extremely probable that the proverbial year-to-year variation in the general 
weather pattern is reducible to the variation in the prevalence of types of storms. 

The existence and the various characteristics of the different types of storms 
are relevant to the problem of weather modification from a number of points of 
view and it is regrettable that, after checking that identifiable types of storms 
do exist, the staff of the Advisory Committee proceeded with evaluations with- 
out any reference to types of storms. 

First, there is the likelihood that the effects of seeding, if any, depend upon 
the type of storm. For some of them, those with a greater turbulence and a 
favorable air flow, the silver iodide smoke may be lifted from the ground into 
the clouds where it may do some good. With some other weather characteristics 
the effect of silver iodide smoke may well be limited to a contribution to local 
smog. 
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The second point to consider is the possibility that, at least some of the com- 
mercial cloud seeders are aware of the existence of different types of storms and 
use this knowledge in selecting the storms to be seeded. This is the point of view 
which occurred to us during our earlier study and here is a relevant quotation 
from Vernon [22, p. 181], a professional meteorologist. 


“It is readily apparent that if a cloud seeding operator . . . had been clever or fortu- 
nate enough to have seeded only Class I, storms, he could point to the regression for 
seeded storms (which would have been coincident with that for I,) and by comparing 
it with the regression for all storms lay a rather convincing but, as we can see, invalid 
claim to have produced a large increase in rainfall. On the other hand, if he had been 
so unfortunate or unwise as to have seeded only Class II, storms, any moderate in- 
crease in precipitation which he might have brought about could easily have been 
obscured by the expected difference between the regression for that type and the one 
for all types combined.” 


The answer is, of course, that if nonrandomized commercial cloud seeding 
operations are evaluated at all, the evaluation based on historical regression 
lines must be done by types. The unavoidable difficulty is that one can never be 
certain that the definition of the various types of storms is adequate. The life 
insurance companies have similar problems with the differences in mortality 
rates from one group of the population to another and, at considerable expense, 
continuously dig into their records and employ physicians to achieve a correct 
classification of each prospective policy holder. 

The third aspect of the problem of types of storm is the plausible dependence 
of the results of the historical regression evaluation both on the selected control 
area and on the epoch to which the historical regression line refers. It is com- 
mon knowledge that periods of drought may last for several consecutive years. 
This makes it plausible that the several, say 10, years of the preseeding era 
used for the establishment of a historical regression line may be dominated by 
storms of a particular type, say those unfavorable to the target of seeding 
operations. If during the period of cloud seeding these unfavorable storms are 
replaced by those more favorable to the target, then in a complete absence of 
seeding effects, the historical regression method may show a significant posi- 
tive deviation of seeded precipitation. On the other hand, if the historical re- 
gression line is based not on 10 but, say, on 30 years of observations, the ap- 
parent positive effect of seeding may well disappear or even turn into an 
apparent negative effect. Similar changes in the apparent effects of seeding may 
be expected from a change in the control area. 

Our first piece of circumstantial evidence that the described mechanism of 
varying types of storms is capable of producing the difficulties just described is 
the paper [5] by Brier and Enger describing their two attempts to evaluate the 
cloud seeding operations in Central Arizona. Using a 10-year period of observa- 
tion preceding cloud seeding and the data from 72 stations, the authors found 
a regression equation indicating a significant positive effect of cloud seeding. 
Then, using a 30-year period of observations at 37 stations, the authors found 
another regression equation with which the precipitation during the seeded 
years appears distinctly within normal limits. No explanation is offered and 
the paper ends with the query as to which of the two results should be given 
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credence. The only plausible explanation that occurs to us is the mechanism 
of types of storms. 

Our second item of circumstantial evidence in favor of this mechanism is 
represented by the results of our own evaluations. Full details are published 
in reference {12]. Here we reproduce a few excerpts and supplement them by 
certain recalculations whose need arose during the preparation of the present 
paper. 

Briefly, the circumstantial evidence mentioned consists in the fact that in all 
cases, without exception, when the seeded storms showed a deviation from the 
corresponding historical regression which is significant at five per cent or 
sharper, the storms during the same period which were left unseeded showed a 
deviation from the same regression line having the same sign as the sign of the 
seeded storms. Also, if one does not limit oneself to cases of significance but 
takes all the 24 separate evaluations performed, the deviations from the regres- 
sion lines of the seeded and the unseeded storms in the same season are posi- 
tively correlated. This correlation is strong enough for us to suspect the exist- 
ence of a factor which, in a given season, affects both the storms that the com- 
mercial cloud seeding operator chooses to seed and also those that he decides 
to leave without seeding. The factor that seems most plausible to us is the 
prevalence during the given seasons of an unidentified type of storm. The 
operator must have been unaware of this type; he seeded some storms belong- 
ing to it and left some others without seeding. 

As already mentioned, out of the three cloud seeding projects evaluated by 
us, only two showed significant deviations of seeded storms from the cor- 
responding historical regressions, the Carrizo Plain project and the Santa 
Barbara project, both of 1950-52. Complete evaluation results, slightly sup- 
plemented, are given in Tables 3 and 4, respectively. Of the three panels in 
Table 3 the two extremes, giving the deviation from historical regressions, 
averaged per storm (in the original there are totals of these deviations), first 
for storms in the two seasons that were seeded and then for all storms whether 
they were seeded or not. The significance probabilities were recorded only for 
those cases where they were 0.20 or less. 

Unfortunately, at the time of our earlier study our attention was directed 
toward certain other aspects of the problem and it did not occur to us to make 
a complete separate evaluation of those storms in the seeded period which were 
left without seeding and, therefore, no significance probabilities for unseeded 
storms are available. The evaluation of all storms, whether seeded or not, was 
performed at the request of the Division of Water Resources which was in- 
terested in the over-all presumed effect of seeding. The middle panel of Table 3 
was calculated in connection with the present paper using the data in the two 
extreme panels. 

While Table 3 contains the results of 18 separate evaluations, only six of 
these are strictly independent, namely those performed for seeded storms of 
the three types for each of the two separate seasons. Of these six independent 
evaluations, three have significance probabilities less than one-tenth, not very 
likely to be due to chance alone. Our own attitude is that the seeded storms in 
those two seasons had target precipitation distributed differently than in the 
preseeding period used for the construction of the historical regression lines. 
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Although the homogeneity of the three types of storms is subject to doubt, 
it was natural to supplement the valuation by separate seasons with one for 
two seasons combined. In one case, for type C, the significance of the average 
effect was increased. In the other two cases there are indications of year-type 
interactions. Because of the fact that of the three “working” types A, B, and 
C only the last coincides with a single original type of Vernon, while A and B 
are conglomerations, the consistency of the two-year results of type C and the 
lack of such consistency for the two other types is not surprising. 

If one adopts the attitude of the Advisory Committee that the deviations from 
historical regression lines of precipitation from seeded storms reflects the ef- 
fects of seeding, then the first panel in Table 3 indicates that in some cases this 


TABLE 3. SUMMARY OF EVALUATION OF THE 
CARRIZO PLAIN PROJECT 1950-52 
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effect is NEGATIVE. In other words, the indications are that the seeding di- 
minished the precipitation. Against this hypothesis is the consistency of the 
results of evaluation of seeded storms with those corresponding to all storms 
in the two seasons, whether seeded or not seeded. This kind of consistency is 
also apparent in Table 4 summarizing the evaluation of the Santa Barbara 
cloud seeding project. Here again separate evaluations were made for seeded 
storms and for all storms in the season, whether seeded or not, so that the mid- 
dle panel does not have significance probabilities listed. 

As mentioned earlier, the Santa Barbara target has a most complicated to- 
pography and, in order to obtain a more or less convincing system of types of 
storms, the whole target has to be divided into five subtargets each with a sepa- 
rate system of types. Actually, the classifications for targets 1, 4, and 5 coincide. 
However, there was the expectation that the same type of storms may have dif- 
ferent properties in the three subtargets. 

A glance at Table 4 indicates that the Santa Barbara project is marked with 
very strong positive apparent effects of seeding, with significance probabilities 
reaching one-tenth of one per cent. However, here again the assumption of 
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effectiveness of seeding is weakened by the remarkable consistency of devia- 
tions from the historical regression lines of the seeded and the nonseeded storms, 
For each of the five subtargets there is just one type of storm for which the 
seeded storms show a significant increase in precipitation and, invariably, the 
nonseeded storms of the same type also show an increase. It is interesting that 
in three of the five cases, the order of magnitude of increase in precipitation of 
nonseeded storms is distinctly of the same order of magnitude as that for the 


TABLE 4, SUMMARY OF EVALUATION OF THE 
SANTA BARBARA PROJECT, 1950-52 
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seeded storms. It is most unfortunate that no significance probabilities for the 
nonseeded storms have been computed. 

In Table 5 the evaluations performed are classified according to the sign of 
deviations from historical regression lines of seeded and not seeded storms of 
the same type, irrespective of whether any of the deviations were significant 
or not. There were 24 such separate evaluations for which a elaim of independ- 
ence can be made: 3 types X2 seasons=6 for Carrizo Plain and 18 for Santa 
Barbara. 

In the case of complete independence of both seeded and not seeded storms 
and also of the particular observations, a set of 24 such observations would 
show the association exhibited in Table 5 or stronger with a probability 0,033. 
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The correlation between deviations of seeded and not seeded storms of the 
same type was also tested by computing the ordinary correlation coefficient be- 
tween data in the first two panels of Tables 4 and 3 (excluding the last three 
lines). The results are r=0.377 for Carrizo Plain and 0.473 for Santa Barbara. 

Unfortunately, the interpretation of the correlation exhibited is not quite 
clear, As already suggested, the consistent deviations from the historical regres- 
sion lines for seeded and not seeded storms of the same type may be due to the 
prevalence in the two years studied of unidentified subtypes of storms with 
considerable representations among the seeded and among the nonseeded 
storms. A second possibility, not excluding the first, is that the correlation in 
Santa Barbara is due to the lack of independence of observations on the five 
subtargets. These five subtargets are relatively small adjoining areas separated 
by chains of mountains. The evaluations for all these subtargets are based on 
the same storms. It is true that for each evaluation the storms are either 
grouped differently (see Table 4) or, at least, are expected to react differently 


TABLE 5. DEVIATIONS FROM HISTORICAL REGRESSION LINES OF 
TARGET PRECIPITSTION FROM SEEDED AND NON- 
SEEDED STORMS OF THE SAME TYPE 
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to local peculiarities of the terrain. Still, the proximity of the five targets is 
suggestive and there is the possibility that instead of 18 evaluations for Santa 
Barbara we should have only four, which would change the situation consider- 
ably. 

Another possible source of correlation between the deviations of seeded and 
nonseeded storms has been pointed out to us by one of the referees. This is that 
these deviations are measured from the same historical regression line, fitted to 
data of several earlier years and, undoubtedly, subject to random variation. 
Thus, the error in the position of the historical regression line for a given combi- 
nation of target and control affects, in a similar manner, the deviations of 
seeded and of nonseeded storms. There is no doubt that the correlation sug- 
gested by the referees exists. However, the order of magnitude of this correla- 
tion, say p, is substantially smaller than the two values 0.377 and 0.473 ob- 
tained for our evaluations. As is vaguely indicated in Tables 1 and 2 and as is 
confirmed by the data on commercial cloud seeding operations, the storms se- 
lected for seeding are stronger than those left unseeded. Taking this into ac- 
count, specifically, assuming that the mean control precipitation for nonseeded 
storms is less than the long range average, while the analogous mean for seeded 
storms is larger than the long range mean, simple calculations show that p may 
well be negative. Also its upper bound is easy to calculate. For an evaluation 
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by single years, as was done for Carrizo Plain, this upper bound is 1/2», where » 
is the number of years of observations on which the historical regression line is 
based. If the evaluation combines two years of observations, with a double 
number of storms, as was done for the Santa Barbara project, then the same 
upper bound is 1/»+/2. In our case yv=5 and the resulting upper bounds are 0.1 
for Carrizo Plain and 0.142 for Santa Barbara. We should mention that the 
upper bounds indicated are “liberal” and not likely ever to be reached. 

Assertion A(1) of the Advisory Committee describes the increase in precipi- 
tation attributed to seeding as a percentage of the estimate of what would have 
fallen without seeding. Table 6 gives such percentages for the six evaluations 
described above in which the seeded storms showed significant differences from 
expectations. 


TABLE 6. DEVIATIONS FROM THE HISTORICAL REGRESSIONS AS 
PERCENTAGES OF EXPECTATIONS. SIX CASES WHERE SEEDED 
STORMS SHOWED SIGNIFICANT DIFFERENCES 








Seeded storms Nonseeded storms 





Per cent increase | Significance Prob. Per cent increase 





Carrizo —39 .05 —27 
SBI * +464 01 +76 
SB2 495 001 +28 
SB3 +70 001 +22 
SB4 ~ +49 .03 +51 
SB5 f +79 .01 +65 














It is seen that the percentage deviations from the regression lines, both the 
negative one at Carrizo Plain and the positive at Santa Barbara, are sub- 
stantially larger than the 10 to 15 per cent mentioned by the Advisory Com- 
mittee. The latter figures, of course, come from averaging the results of sev- 
eral evaluations. It is interesting that the difference in the performance of non- 
seeded and of seeded storms is rather modest. 

While existing data do not provide reai evidence that the cause of the curious 
correlation between the seeded and nonseeded storms is due to the mechanism 
of varying frequencies of unsuspected types of storms, some favoring the tar- 
get and others favoring the control area, the hypotheses that this is so appears 
very plausible. This, then, is another argument for the assertion that the eval- 
uation of nonrandomized cloud seeding operations using the method of his- 
torical regression lines, even if this method is applied “by types,” is not reliable 
and cannot serve as a basis for conclusions regarding the effectiveness of cloud 
seeding. 


5. BUILT-IN BIAS IN THE EVALUATIONS OF THE 
STAFF OF THE ADVISORY COMMITTEE 


It is well known that, if a function f is not linear and if X is a random vari- 
able, then, generally, the expectation of f(X) is not equal to f[E(X) ]. In par- 
ticular, if the function f is downwards concave then the expectation of f(X) 
exceeds the value of f[E(X)]. Formulae for correcting the bias 
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E[f(X)] — f[E(X)] 


are published from time to time, based on a number of different assumptions. 
For example, see reference [18]. In particular, the formulae relevant to the 
evaluation of cloud seeding operations by the regression method, using pre- 
cipitation data subjected to a transformation capable of stabilizing the condi- 
tional variance, were published in reference [12]. In this paper the original data 
in inches were replaced by their square root. In the papers published in Volume 
II of the Final Report a much more complicated transformation was used, al- 
though it is admitted that (p. 12) “the square root transformation probably 
would have been satisfactory, . .. .” Recalculations performed in the Statisti- 
cal Laboratory indicated that the numerical differences between the results of 
the square-root transformation and the transformation employed by the staff 
of the Advisory Committee are negligible. In fact, in both cases the original 
variable is a convex function of the transformed variable, so the true value of 
the expectation of the target precipitation in inches is greater than the value 
obtained by backward transformation of the mean of the transformed variable. 
In consequence, when comparing the amount of rain from a seeded storm with 
its presumed normal value obtained from the historical regression line eval- 
uated on transformed data and then recomputed in inches, the difference ob- 
tained is systematically too large. The amount of bias varies from storm to 
storm but it is computable. 

Using the data for the Santa Barbara project, month of February, pub- 
lished in the Final Report, Volume II, p. 46, we estimated the bias involved 
in this particular evaluation as 0.06 inches, so that the unbiased estimated mean 
increase for this month in this target is 0.15 inches rather than 0.21 inches. In 
this evaluation, the historical period was divided into “even” and “odd” storms. 
Partly as an illustration of the method, the even nonseeded historical storms 
were compared with the odd nonseeded storms and vice versa. The two com- 
parisons are obviously not independent. The estimated average differences in 
precipitation are —0.003 and +0.093 inches, respectively. The mean of these 
differences, namely, +0.045 is of the same order of magnitude as the estimate 
of bias. 

In his evaluations Thom [20] does not appear to have used any correction 
for the bias just described. With reference to each particular project, the nu- 
merical value of the bias is comparable to the sampling error and, therefore, 
may be judged unimportant. However, the conclusions of the Final Report are 
based not so much on evaluations of separate projects as on a comprehensive 
average of several evaluations. In particular, the estimate of 10-15 per cent 
increase ascribed to seeding is based on such an averaging. In these circum- 
stances, it is relevant that the bias estimated for those projects amounts to 
about one-half of the estimated proportional increase. 


6. NONREPRESENTATIVENESS OF THE SAMPLE OF OROGRAPHIC CLOUD SEEDING 
OPERATIONS STUDIED IN THE FINAL REPORT, VOLUME II 


In his “Summary of the Statistical Evaluations...” Gerald D. Berndt 
[3, p. 92] gives an impressive description of the scope of evaluations performed, 
thus: 
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“There was made available to the Advisory Committee for these evaluations 
a collection of data far larger in both quantity and variety than had ever previ- 
ously been assembled and analyzed in any single effort to evaluate cloud seed- 
ing operations. .. . In all, a total of 427 storms seeded for the purpose of in- 
creasing the precipitation have been analyzed for seeding effects. ... The 
results for the West Coast classes are impressive for a number of reasons... . 
The seeded storms, which total 299 for these classes, are for most of these proj- 
ects those of the entire seeding program... . In this respect, the evaluations 
of the orographic and West Coast classes may be said to be exhaustive of the 
extant information for the operations within these classes.” 

Actually, the staff of the Advisory Committee [paper (ii) Volume II of the 
Final Report] reported storm by storm evaluations of only four cloud seeding 
projects on the West Coast, three in California and one in Oregon. (One other 
California project was evaluated using river discharges not storm by storm.) 
The California projects, all conducted within the period 1950-1955, are identi- 
fied as Mokelumne-Stanislaus, Santa Barbara, and Santa Clara (with two sub- 
targets; one with Mt. Hamilton rain station, orographic and the other with 
San Jose target station, semiorographic). However, the official publication of 
the State of California [10], has a list of cloud seeding projects covering five 
pages 18-22 and we find the following comment (p. 23): “Consequently, in 
1950-51 there was a sudden renewal in activity, with 15 organized programs 
scattered through the southern two-thirds of the State. ... During 1951-52 
the same high level of activity was maintained, with 12 programs and some 
adjustment and consolidation.... During the 1953-54 and the 1954-55 
seasons there were 11 and 12 programs, respectively.” Thus, the California 
projects reported by the Advisory Committee staff amounted to about one- 
quarter the total for that state. The principles on which the particular projects 
were selected for evaluation are described thus (Final Report, Volume II, p. 
30): 

“The projects evaluated were chosen in the several classes [orographic, semioro- 
graphic and nonorographic] on the basis of the quantity and quality of the available 
data on which the evaluation must be based. This was done by determining the 
availability of a suitable control together with suitable target and control measure- 
ments of precipitation. Projects with long operating records were favored since our 
objective was to replicate as much as possible. No project was chosen or discarded on 
the basis of anticipation of either positive or negative results. There was no knowledge 


of actual rainfall amounts or tables prior to analysis; preexamination of data sources 
concerned only the existence of adequate record.” 


The first publication of the California study appeared in 1954. H. C. Thom’s 
paper quotes the California study and, in fact, discussed certain of its aspects in 
some detail. Therefore, when selecting the projects for its own evaluation, the 
staff of the Advisory Committee was well aware that the California study 
showed no significant deviations from historical regression lines for Southern 
Sierra, significant negative deviations for the Carrizo Plain and highly signifi- 
cant positive results for the Santa Barbara project. 

The latter project, giving significant positive results, is included in the 
sample of the Advisory Committee, but the other two projects, giving nonsig- 
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nificant and significant negative results, are excluded. The Carrizo Plain proj- 
ect is discussed in Volume II, p. 34 of the Final Report and the discussion shows 
that a considerable amount of work has been done on it. In fact, the Advisory 
Committee’s staff checked our definitions of storm types and, using a set of 
data which is assertedly larger than our own, produced new types of storms 
showing “more contrast” than those obtained in our study. Yet, this particular 
project is not included in the list of the Advisory Committee used to produce 
the estimated average 10-15 per cent increase in precipitation ascribed to 
seeding. Turning to the principles of selection of projects used by the staff of 
the Advisory Committee quoted above, one might expect that the “preexami- 
nation” of the data on which the Carrizo Plain project could be evaluated did 
not reveal “the existence of adequate record.” However, the following state- 
ment (p. 34) indicates that the staff of the Advisory Committee found excellent 
records for the Carrizo Plain project: 
“, .. the historical meteorological data available in this area is as good as it is any- 
where, and it takes only a cursory examination to show that it would be difficult and 


sometimes impossible to approach this quantity of information for typing on our 
other projects.” 


So, the Carrizo Plain project was discarded for other reasons than the lack 
of reliable data. These reasons are briefly indicated thus (p. 34): 
“It was also clear that the Carrizo Plain was an exceptional and perhaps a trivial 


example of typing from our standpoint, for we had already concluded that because 
of inadequate seeding the project was not satisfactory for evaluation.” 


This statement indicates that the staff of the Advisory Committee selected 
projects for evaluation not only on the principle of availability of data but also 
on some other principle, namely that the seeding be done “adequately.” Unfor- 
tunately, no definition of “adequate” seeding is given, and because of the omis- 
sion of the Carrizo Plain project with excellent records but with an apparent 
negative effect of seeding, the representativeness of the sample of projects eval- 
uated by Thom is open to question. 


7. COMMERCIAL CLOUD SEEDING PROJECTS WITH APPARENT 
NEGATIVE EFFECTS OF SEEDING 


Conclusion A(3) of the Final Report, Volume I, p. vii asserts that “No evi- 
dence was found in the evaluation of any project which was intended to increase 
precipitation that cloud seeding hac produced a detectable negative effect on 
precipitation.” However, according to the data on p. 39 of the second volume of 
the Final Report, the South Carolina project showed four negative “standard 
departures” for monthly evaluations, out of six, and one of them, amounting 
to —1.98 is beyond the one-sided 5-per cent significance point. The standard 
departure for the whole South Carolina project, not given in the Final Report, 
is —1.73, also beyond the 5-per cent significance point. 

The authors believe that the inconsistency between the results of the South 
Carolina project and assertion A(3) of the Advisory Committee has not been 
adequately explained. A speaker at the joint meeting of the American Meteor- 
ological Society and the American Association of Civil Engineers held in Den- 
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ver in 1949 suggested that the evaluation of the South Carolina project was 
ignored because the area covered by the evaluation is larger than the irltended 
target of seeding operations. This does not seem to be a sufficient reason for 
ignoring the indicated negative effect of seeding. 

On the contrary, it seems to us that if it is determined that negative effects 
of seeding are observable in areas surrounding the proper target of seeding 
operations, then this constitutes a very significant result worthy of explicit 
mention and further study. 

In addition to the South Carolina project, there is, of course, the Carrizo 
Plain project. As already mentioned, the staff of the Advisory Committee was 
well informed of the California study and apparently was attracted by the no- 
tion of the varying types of storms. This idea of types of storms caused the 
statisticians of the Advisory Committee to perform quite some work on the 
Carrizo Plain data. 

With this amount of work done, intended merely to verify the existence of 
differences in regression lines corresponding to the identifiable types of storms, 
we wonder how the authors of the relevant paper could have withstood the 
temptation to check the main part of our analysis indicating negative effects 
of seeding in Carrizo. It is true that, according to the passage already quoted, 
they have already decided “that the project was unsatisfactory for the evalua- 
tion because of inadequate seeding.” Still, adequate seeding or not, is it not 
interesting to see whether and to what extent the seeded storms deviate from 
the corresponding regression lines? 

Whatever the case may be in this respect, it is our opinion that the evidence 
indicating apparent negative effects should have been thoroughly investigated. 
If mistakes were found, they could have been reported. Failure even to mention 
these negative effects seems to us a serious omission. 


8. CURIOUS SELECTION OF THE “RAINFALL VARIABLE” IN THE STUDY OF 
THE SOUTH DAKOTA RAIN MAKING OPERATIONS 


The study by the staff of the Advisory Committee failed to show any evi- 
dence of positive effects of seeding in the nonorographic projects. However, a 
sincere effort was made to give the farmers some indication as to the risks in- 
volved alternatively in paying for cloud seeding and in abstaining from such 
expense. The theory of this problem was developed by Gerald D. Berndt [2]. 
It arbitrarily assumes an a priori distribution of a particular parameter sup- 
posed to characterize the distribution of seeded rainfall. Also there is defined a 
loss function taking into account both the cost of seeding and the benefits from 
rain. Then a decision rule is determined whether to seed or not to seed. The de- 
cision rule, minimizing the risk, is to seed whenever the sum )-*_,2; of cer- 
tain observable variables exceeds a specified limit and not to seed in other cases. 
These variables x; are the subject of our particular interest. They are called 
“rainfall variables” and are defined as follows (p. 73): 


“ ... we shall define our rainfall variable for each storm in the following way. There 


are five target stations. If during the ith storm the sth (s=1, 2, +--+, 5) station has 
d, days with rainfall whose total amount is ¢, inches, then z;, the average rainy-day 
storm amount per station, is given by 
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y= Yt > a.” (1) 
s=l s=l 

As indicated above, if for a given set of seeded storms, the sum of the vari- 
ables (1) is large, then the indicated decision is to seed, but not otherwise. The 
practical consequences of this choice of observable variables in the presence of 
either positive or negative effects of seeding are not discussed in the Final Re- 
port. We submit the following possibility. 

Suppose, as expected in the various parts of the Final Report, the seeding 
has a threefold positive effect: (a) by triggering rainfall ahead of the time it 
would start without seeding, (b) by extending the duration of rain as compared 
to normal, without seeding, (c) by increasing somewhat the amount of pre- 
cipitation during the period in which normal precipitation would fall, without 
seeding. 

In line with these hypotheses, and assuming for simplicity that the target 
contains just one raingage, we may visualize the following amount of precipi- 
tation from a single storm to be expected alternatively in the absence and in 
the presence of seeding. 


TABLE 7. HYPOTHETICAL POSITIVE EFFECT OF SEEDING 
A THREE-DAY STORM 


| 
Day number 1 2 3 
| 








t=total | d=days 
precip. | with rain 








Precipitation in inches: | 
Without seeding ’ ‘ .00 1.00 
With seeding 1.50 














In other words, in the presence of positive effects of seeding, of exactly the 
kind expected by theorists of weather modification, amounting to 50 per cent 
of the amount which would have fallen without seeding, the rainfall variable 
x assumes a value equal to one-half that corresponding to the unseeded storm. 
Conversely, if, contrary to expectation, the seeding causes a decrease in pre- 
cipitation, so that the lower line in Table 7 corresponds to the unseeded storm 
and the upper to one subject to seeding, this negative effect of seeding would 
correspond to a two-fold increase in the “rainfall variable.” Yet, the decision 
rule deduced indicates: seed when the sum total of values of x is large, other- 
wise do not seed! 

After applying the theory to the data of seeding in South Dakota, Berndt 
arrives at the following result: “Application of the Bayes’ procedure to the 
seeded South Dakota storms indicates, as an interim result, that ... there is 
less risk in choosing to seed in May, June, and August—but not July, than in 
taking the alternative course of action in those months.” The illustration in 
Table 7 indicates the possibility that the farmers in South Dakota will run a 
lesser risk by doing just the opposite, seeding in July but not in May, June, 
and August. 

Apart from the inept choice of the rainfall variable, the paper by Berndt is 
impressive by the use of sophisticated statistical procedures: test of storm to 
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storm independence, Kolmogorov-Smirnov test, power functions, risk func- 
tions, and Bayes’ decision rules. As indicated in the paper by Dr. Berndt, it was 
prepared in consultations with and under supervision of Admiral F. A. Berry 
and Professor Max A. Woodbury, the Chief Scientific Adviser of the Committee 
and the Chairman of the Statistical Advisory Panel, respectively. 


9. A MISREPRESENTATION AND AN OMISSION 


A considerable part of the second volume of the Final Report, pp. 97-207 
and 247-306, is given to reports on various studies either sponsored or directly 
organized by the Advisory Committee. Because of our participation in the co- 
operatively conducted Santa Barbara Randomized Cloud Seeding Experiment 
started in January 1957, we are familiar with the background of two of these 
reports, one by Robin Reynolds [19] and the other by MacCready, et al. [13]. 
The first of these papers contains a misrepresentation, the other a significant 
omission. If we did not mention these two incidents, our discussion of the Final 
Report would be incomplete. 

(a) Robin R. Reynolds is the Chairman of the Board of Directors of the 
Santa Barbara Project. His paper gives a description of the project and 
enumerates the cooperating parties including the North American Weather 
Consultants, Inc. (NAWC, for short), a commercial cloud seeding firm perform- 
ing the seeding for the project, and also including the Statistical Laboratory of 
the University of California, Berkeley, responsible for the randomization of 
the operations and for the statistical evaluation of the results. This article con- 
tains the following statement, p. 252: “Although no definite answer can be ob- 
tained from the evaluation of this one season, it is interesting that all com- 
parisons so far indicate increased precipitation in the target area during periods 
of cloud seeding as compared to the nonseeded periods. A rough estimate of a 
23 per cent increase can be made from the preliminary data.” Although there is 
no clear-cut statement in the article of the authorship of the estimate of 23 per 
cent increase, the reader is led to believe that the Statistical Laboratory was 
at least partly responsible for it. 

It is relevant that the Final Report is dated December 31st, 1957. It is also 
relevant that in December, 1957 the NAWC issued a mimeographed booklet 
[18] reporting on its commercial operations in the Upper San Joaquin Basin. 
The booklet begins with a description of the contemporary status of cloud 
seeding. This description includes the findings of the Advisory Committee on 
Weather Control and also a reference to the Santa Barbara Randomized Cloud 
Seeding Project. The relevant sentences collected on p. 5, are as follows. 

“_.. The statistical design and analysis is being conducted by the Statistical Labo- 
ratory of the University of California at Berkeley. . . . The data for the first year have 


been analyzed. . . . The average increase for the first season was 23 per cent, although 
this figure must be considered tentative. . . .” 


The concluding sentence on p. 5 reads: 


“... The fact that the Advisory Committee findings and the preliminary results of 
the Santa Barbara project agree so closely with our findings for the Upper San Joaquin 
Valley project lends considerable additional confidence to those results.” 


Here, then, we see the same estimate of 23 per cent increase in Santa Bar- 
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bara and a strong suggestion that this estimate was obtained by the Statistical 
Laboratory of the University of California. The last sentence quoted indicates 
that this estimate and the references to the findings of the Advisory Committee 
are used to reinforce the confidence in the claims of the NAWC of success in 
its commercial operations. 

As a matter of fact, the estimate of 23 per cent has no relation to the findings 
of the Statistical Laboratory. A preliminary evaluation, based on incomplete 
data, presented at the meeting of the Board of Directors on September 4th, 
1957, indicated increases of 100 to 200 per cent, much greater than ever claimed 
by the cloud seeding industry, and raising concern about the reliability of the 
raw data. Those preliminary results for 1957 were confirmed by the complete 
evaluation of the three years of operation of the project, recently published 
[16] in the “Interim Report” of the Board of Directors of the project. This 
same paper records (pp. V-55 to V-56) the inconsistency of Reynolds’ article 
with the official evaluation results. 

(b) While the contacts between the Advisory Committee and the main part 
of the Santa Barbara Randomized Experiment were limited to the participation 
of the Committee’s Chief Scientific Adviser, Admiral F. A. Berry, on the 
Project’s Board of Directors, the study of MacCready, et al. [13] was con- 
ducted “for the Advisory Committee.” As stated in the Summary, p. 138, the 
subject of the voluminous report includes the discussion of “the relative value 
of statistical studies and physical evaluations for establishing the validity of 
cloud seeding effects... it is noted that each has its role but that physical 
evaluation can offer particular advantages when a variety of possible effects 
must be studied within a few years.” 

Among the several studies described in the report, one (pp. 191-196) is con- 
cerned with the authors’ work conducted in the mountains in Santa Barbara 
County in parallel with the randomized cloud seeding experiment mentioned 
above. This work consisted in taking time-lapse movies of the radar screen, in 
making studies of seedability and, in particular, in using radar data to detect 
the effects of seeding. The article describes instrumentation, a number of de- 
tails of several storms and reproduces six pictures of the radar screen. Also, 
p. 191, the article states that “ . . . about 110 hours of time-lapse PPI pictures 
were taken during the storms. These were examined for seeding effects without 
knowledge of the silver iodide generator operation and with the information 
from a time-height cross section of the storm, and then were reanalyzed with 
the knowledge of the silver iodide generator operation.” However the article 
omits a particular item reported at the meeting of the Board of Directors of 
the Project held on September 4th, 1957, which seems to us quite relevant. 
This is that, operating without knowledge as to when the silver iodide genera- 
tors were active, MacCready, et al. detected the results of seeding on five dif- 
ferent occasions, although on three of these occasions there was no seeding at all. 
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THE USE OF SAMPLE RANGES IN SETTING EXACT CONFIDENCE 
BOUNDS FOR THE STANDARD DEVIATION OF A 
RECTANGULAR POPULATION 


H. Leon Harter 
Aeronautical Research Laboratory, Wright-Patterson Air Force Base 


A discussion is given of point estimates and interval estimates of the 
population standard deviation ¢, based on the sample range and quasi- 
ranges. In the case of a rectangular population, the efficient point esti- 
mate and the most effective interval estimates are those based on the 
sample range, so it is not necessary to consider estimates based on 
sample quasi-ranges. The coefficients of the sample range w in the 
exact confidence bounds for the population standard deviation ¢ are 
found by taking the reciprocals of percentage points of the (stand- 
ardized) range W =w/c. A six-decimal-place table of percentage points 
for the range corresponding to selected cumulative probabilities and 
sample sizes is presented. A second table shows coefficients of the sample 
range in exact lower confidence bounds. 


1. INTRODUCTION 


INCE soon after the appearance in 1935 of a justification by E. 8. Pearson 
S and Haines [7], it has been common practice to use the ranges of small 
samples to estimate the population standard deviation o. For the normal 
population, the efficiency of estimates based on the sample range decreases 
quite rapidly as the sample size n increases. In 1946 Mosteller [6] advanced 
the suggestion, which was latent in a 1920 paper by Karl Pearson [8], of using 
quasi-ranges of larger samples to estimate ¢. Harter [2] has tabulated the most 
efficient unbiased estimates of o based on one quasi-range for samples of size 
n=2 (1) 100 from normal and rectangular populations and on linear combina- 
tions of two quasi-ranges for samples of size n=4 (1) 100 from the normal 
population. Chu, Leone, and Topp [1] have proposed a procedure, using sample 
quasi-ranges, of setting confidence bounds for the population standard devia- 
tion. Leone, Rutenberg, and Topp [5] have obtained approximate confidence 
bounds for the standard deviation of normal, exponential, and rectangular 
populations by first applying distribution-free methods and then imposing 
the distribution. Though the basic procedure is sound, the use of distribution- 
free methods yields approximate confidence intervals for the standard devia- 
tion of the rectangular population which are much longer than the exact 
confidence intervals which are given in the present paper. 

For the rectangular population, which is the one under consideration in this 
paper, the efficient point estimator of the population standard deviation is 
the one based on the sample range. Similarly, interval estimates based on the 
sample range are more effective than those based on any of the sample quasi- 
ranges. In order to determine the coefficients of the sample range w in the 
exact lower confidence bounds for ¢, it has been necessary to tabulate the 
probability integral of the range, then perform inverse interpolation to deter- 
mine the percentage points of the range, and finally take the reciprocals of 
the percentage points. (See reference [3] for a discussion of the method of com- 
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putation of a double-precision table of the probability integral of the range and 
for an eight-decimal-place version of the table.) The following tables, together 
with a discussion of the methods of computing and checking them, are included 
in this paper: (1) A six-decimal-place table of the percentage points of the 
range corresponding to cumulative probabilities P=0.0001, 0.0005, 0.001, 
0.005, 0.01, 0.025, 0.05, 0.1 (0.1) 0.9, 0.95, 0.975, 0.99, 0.995, 0.999, 0.9995, 
0.9999 for sample sizes n=2 (1) 20 (2) 40 (10) 100; and (2) a table, to seven 
significant figures or six decimal places, whichever is less accurate, of the co- 
efficients of the sample range w in the exact lower confidence bounds for o for 
the above values of P and n. The exact confidence intervals between bounds 
corresponding to P and 1—P for 1—P=0.995, 0.990, 0.975, 0.950, 0.900 are 
much shorter than the approximate confidence intervals obtained by Leone, 
Rutenberg, and Topp, especially for n small and P small (1—P large). 


2. METHOD OF COMPUTATION OF THE TABLES 


2.1 Percentage Points of the Range. The probability integral of the range 
W for samples of size n for R(u, 1), the rectangular population with mean 4 
and variance 1, may be written (see [4, p. 192]) in the form 


P(W, n) = (W/2V3)""[1 + (n — 1)(1 — W/2¥V3)]. (1) 


Values of P(W, n) were computed from equation (1) for W=0.01 (0.01) 
3.46 (0.001) 3.464 and n=2 (1) 20 (2) 40 (10) 100, using double-precision, 
floating-point arithmetic on the Univac Scientific (ERA 1103A) computer. 
The percentage points of the range corresponding to cumulative probabilities 
P=0.0001, 0.0005, 0.001, 0.005, 0.01, 0.025, 0.05, 0.1 (0.1) 0.9, 0.95, 0.975, 
0.99, 0.995, 0.999, 0.9995, 0.9999 for the above values of n were computed 
from the unrounded table of the probability integral by iterative inverse linear 
interpolation on the ERA 1103A. The resulting percentage points, rounded 
to six decimal places, are shown in Table 1. The percentage points in Table 1 
were checked by verifying that P(W —5X10-’, n) <P and that P(W+5X10~’, 
n)>P, using equation (1). 

2.2 Coefficients of w in Lower Confidence Bounds for co. The coefficients 
of the sample range w in the lower confidence bounds (confidence P) for the 
population standard deviation o were found by taking the reciprocals of the 
unrounded percentage points of the range for the same values of P and n. 
The results, rounded to seven significant figures or six decimal places, which- 
ever is less accurate, are shown in Table 2. The coefficients in Table 2 were 
checked by adding and subtracting 5 in the position immediately to the right 
of that occupied by the least significant digit, taking reciprocals, and then verify- 
ing that the corresponding probabilities, as computed from equation (1), 
bracket the required probability P. 


3. EFFECTIVENESS OF CONFIDENCE INTERVALS 


Leone, Rutenberg, and Topp [5] have defined the effectiveness of confidence 
intervals for ¢ as the ratio of the expected length E(l) of the (exact) intervals 
based on the efficient estimate of o to the expected length E(ly) of the intervals 
under consideration..In the case of the rectangular population, the efficient 
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estimate of o is the one based on the sample range, and hence E(I) is the ex- 
pected length of the exact confidence intervals for which the coefficients of w 
are given in Table 2. The effectiveness of the approximate confidence intervals 
proposed by Leone, Rutenberg, and Topp is the ratio of E(l) to the expected 
length E(l) of the intervals between their approximate lower confidence 
bounds corresponding to cumulative probabilities P and 1—P. Numerical 
values of the effectiveness (per cent) of their approximate confidence intervals 
for 1—P =0.995, 0.990, 0.975, 0.950, 0.900 and n= 10 (10) 100 are as follows: 








n=10 n=20 n=30 n=40 n=! n=60 n=70 n=80 n=90 n=100 





0.995 | 12.94 42.78 49.89 53.04 54. .96 56.76 ‘ 57.79 58.14 
0.990 | 21. 45.78 51.71 54.37 5.85 57.52 ‘ 58.40 58.71 
0.975 | 31.79 49.19 53.63 55.65 -54 58.06 , 58.74 58.97 
0.950 | 38. 50.90 54.25 55.78 .23 57.63 . 58.15 58.33 
0.900 | 42.35 50.82 53.10 54.16 54. -16 55.44 , 55.80 55.93 








It is also interesting to look at the exact confidence associated with the ap- 
proximate confidence bounds given by Leone, Rutenberg, and Topp. For ex- 
ample, using a sample of size 50, they give 0.289268w as the 90 per cent lower 
confidence bound for c. It can be seen from Table 2 that the actual confidence 
associated with this lower bound exceeds 0.995, while the exact 90 per cent 
lower confidence bound for ¢ is 0.291793w. 

The author gratefully acknowledges the assistance of Eugene H. Guthrie, 
who programmed the computations for the computer. 
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(The tables follow on pages 604-609) 





TABLE 1. PERCENTAGE POINTS OF THE RANGE 


for Samples of n from R(y, 1) 
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.3836722 | 3.348680 
.376769 | 3.385012 
-410276 | 3.415379 
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-447562 | 3.449138 
.452451 | 3.453561 
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.867689 
.949350 


-036043 
. 129653 
.189125 
. 234531 


3.402202 


3.422616 
3.435772 
3.446698 
3.451968 
3.458774 
3.460350 
3.462433 








AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1961 


TABLE 1.—(Con?'d.) 











34 36 38 40 50 


| 2.384535 | 2.438357 | 2.487122 
| 2.521520 | 2.569834 | 2.613471 
2.583586 | 2.629264 | 2.670463 
2.735934 | 2.774783 | 2.809711 
2.805626 | 2.841189 | 2.873118 
2.902325 | 2.933167 | 2.960805 
0500 2.979649 | 3.006584 | 3.030686 





.531503 
.653075 
. 707809 
.841282 
.901942 
. 985713 
.052379 


.572058 
.689175 
.741814 
- 869955 
. 928091 
.008275 
.072006 


. 731610 
-830392 
.874506 
. 981192 
.029275 
-095280 
. 147489 


SCnNnwN ND WY 
OwWnNNN Ww bd 
oo Gh & bo dw bw be 


122726 
198209 
. 245914 
. 282210 
312460 
339243 
364190 
388770 
416317 


. 139179 
-211172 
. 256627 
.291188 
.319978 
.345456 
. 369180 
. 392546 
417771 


. 1000 3.061590 .084262 . 104519 
. 2000 3.149904 . 167844 . 183846 
.3000 3.205923 .220788 | 3.234035 
-4000 3.248649 . 261132 . 272248 
.5000 3.284328 . 294798 .304115 
.6000 3.315967 . 324634 . 832342 
. 7000 3.345482 .352451 . 358646 
.8000 3.374604 .379883 . 384574 
. 9000 3.406100 .409536 .412588 


. 202283 
. 260746 
. 297523 
.325417 
. 348608 
.369097 
.888147 
.406883 
-427081 


CSCWwwwwwwww 
ww wewewww wo 
WOwWwwwwwww 


.431427 
.441797 
.450405 
454554 
.459910 
.461150 
462789 


433074 
.442923 
451097 
455036 
.460122 
.461300 
.462856 


. 9500 3.425236 .427545 .429594 
0.9750 3.437564 .439143 .440545 
0.9900 3.447801 .448772 .449634 
0.9950 3.452737 .453415 .454016 
0.9990 3.459112 .459409 .459674 
0.9995 3.460588 .460798 .460984 
0.9999 3.462539 .462632 .462715 

P\n 60 70 80 ¢ 100 
0.0001 . 842924 .924944 
0.0005 .928185 .999877 
0.0010 . 966099 .033098 
0.0050 .057391 . 112840 
0.0100 .098356 . 148511 
0.0250 .154414 .197214 

.0500 . 198613 . 235527 


.439320 
.447192 
.453721 
.456866 
.460926 
-461866 
.463107 


3 
3 
3 
3. 
3 
3 
3. 


| oo oo eo Ge eo Go OD 
wwwawwe 





| 





. 987868 
.054674 
.084225 
. 154993 
. 186575 
. 229624 
. 263431 


037659 
.097913 
. 124518 
. 188117 
. 216449 
.255017 
. 285264 


078035 
132898 
157089 
214833 
240519 
.275449 
.302814 


wowwwn nsw 
wOwwwwwrn 
Co 09 8 Go Oo ow | 
ow wwwww 


. 298685 
. 336050 
. 359426 
.377091 
.391734 
.404640 
.416613 
.428365 
.441005 


.316769 
.350121 
. 370964 
. 386704 
.399745 
.411234 
.421887 
.432340 
.443578 


. 331290 
.361405 
.380211 
.394404 
.406159 
.416510 
-426107 
-435519 
-445636 


. 1000 
. 2000 
. 3000 
.4000 
.5000 
.6000 
. 7000 
.8000 
. 9000 


. 244868 .275540 
. 294075 .318018 
.324953 .344627 
348334 .364751 
.367748 .381446 
. 384882 .396169 
.400796 .409834 
.416434 .423253 
.433276 .437695 


CHwwwwwwww 
CWwWWwWWwWW WW WwW W 
CwWwwwwwww 
wCOwwwwwww w 


.448651 
.453564 
.457634 
.459595 
462124 
462709 
463483 


-450374 
.454739 
.458356 
-460098 
462345 
462865 
463552 


.443473 .446434 
.450028 .452050 
.455463 .456705 
.458081 458947 
.461459 .461839 
.462241 .462509 
.463274 .463393 


.9500 
.9750 
. 9900 
.9950 
. 9990 


-451752 
-455679 
-458933 
.460500 
.462521 
-462989 
-463607 





wowwwwww 
CwWwwwwww 
Ow WwW Ww Ww w 
wow ww Ww Ww & 























TABLE 2. COEFFICIENTS OF w IN EXACT LOWER 
CONFIDENCE BOUNDS FOR e 


(Confidence P) for Samples of n from R(u, o) 











2 


3 


4 


5 


6 


7 





-0100 
0250 
0500 


cococco 


1000 
2000 
.3000 
-4000 
-5000 
-6000 
. 7000 
.8000 
. 9000 


ceoecoococococco 


. 9500 
.9750 
. 9900 
. 9950 
.9990 
.9995 
.9999 


0 
0 
0 
0 
0 
0 
0 


P\n 


-0001 
.0005 
.0010 
.0050 
-0100 
.0250 
0500 


coco 


cocec 


. 1000 
. 2000 
. 3000 
-4000 
.5000 
-6000 
. 7000 
.8000 
9000 


eceeoeocoocco 


-9500 
.9750 
-9900 
9950 
.9990 


coeocco 


5773 .358 
1154 .556 
577 .2059 
115.3255 
57 .59033 
22 .94876 
11.40082 


.625364 
. 734370 
. 767327 
. 280705 
. 985599 
. 785416 
.638270 
. 522218 
.422180 


.371816 
342891 
.3820750 
.310641 
. 298162 
. 295278 
.291591 


8 


49 .90350 
22 .26383 
15.71427 
6.972804 
4.900845 
3.061264 
2.132799 


.474336 
-005344 
. 794685 
.666792 
.577350 
- 509065 
.453362 
.404954 
. 358959 


ccocr- 


coco 


.833864 
-318731 
.306743 
.301142 
. 294077 
292467 
290355 


cocococeo 


9.799300 
5.699450 
4.507863 
.603374 
.049266 
-487093 
. 161182 


— e DO bO 


. 900813 
.§91361 
-587221 
. 519667 
.469946 
.430323 
.396741 
. 366486 
. 336671 


ceoeococooeoeooceo 


.319901 
. 309600 
.301331 
- 297433 
. 292484 
.291351 
. 289862 


eooeccco 


. 256924 
.825874 
. 365921 
-559586 
.299915 
.017960 
.842622 


ork eK wh eS 


.693748 
. 566243 
-499445 
.454596 
-420693 
393059 
369162 
347219 
325171 


cooocococceccoceo 


312568 
.304749 
298428 
295435 
291621 
. 290746 
. 289594 


| ssesess 


.555358 
.836760 
.691495 
- 136549 
. 980842 
.804635 
.690286 


ccocorr Kw 


. 689513 
-499825 
.451307 
.418033 
. 392462 
371326 
352819 
335622 
318133 


cooccocco 


. 808035 
.801732 
- 296613 
. 294182 
.291078 
290365 
. 289425 


ooocococo 


-808319 
.370956 
215585 
-916172 
.809440 
-685239 
.602289 


cococcrH- 


.527323 
-458842 
-420983 
394641 
374169 
357087 
.342000 
.327868 
.313377 


coocoococo 


- 804952 
. 299671 
295370 
- 293323 
. 290705 
- 290102 
289309 


eceoocooecco 





9 


10 


11 


12 


13 








.407938 
. 109082 
. 999832 
783415 
703986 
609675 
545369 


cocooere | 


.486186 
.431093 
-400153 
. 378400 
.361355 
.347034 
-334307 
522316 
309945 


ceoecocecoococo 


.802718 
298173 
. 294464 
292696 
290432 
- 289911 
- 289224 





eoeoecoco 


. 164604 
.944288 
.861999 
.695552 
.633106 
. 557823 
.505681 


cocococec 


457026 
411083 
384973 
366469 
351880 
.339558 
328556 
.318144 
. 307353 


cooocccococo 


301024 
297035 
-293774 
292218 
290224 
289765 
289159 





ceocoococo 





| 


.003474 
.832229 
. 767188 
-633458 
. 582420 
.520150 
-476490 


cooocccre 


-435307 
.895981 
.373422 
.357334 
.344589 
.333780 
. 324093 
.314893 
. 305324 


cooceocococoooco 


0.299695 
.296141 
- 293231 
. 291842 
. 290060 
- 289649 
- 289108 


.889969 
. 751622 
.698365 
. 687412 
.544481 
.491596 
.454148 


cooceoco 


.418517 
384184 
364340 
350118 
338806 
329181 
320529 
.312289 
. 303693 


coocoocococooco 


- 298625 
. 295420 
. 292793 
. 291538 
. 289928 
. 289556 


coeococco 





0.289066 





806206 
-691119 
.646325 
.551988 
.515071 
-469235 
.436514 


coocecceo 


.405157 
.374716 
.857013 
.344273 
.334107 
.825433 
317617 
.310156 
. 302354 


eooococoocooco 


-297744 
- 294826 
. 292432 
291287 
- 289818 
.289479 
0.289032 


coococceo 





. 742117 
.644174 
-605700 
.523935 
-491635 
-451265 
.422249 


coococo 


, 394276 
- 366952 
.350978 
. 339442 
.330213 
322320 
-315193 
. 308376 
. 301234 


oeoeoeoccoccoc & 


0.297007 
0.204328 
0.292129 
0.291077 
0.289726 
0.289415 
0.289004 
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14 15 16 17 18 





.691649 
-606767 
.573164 
.501195 
.472537 
-436517 
.0500 .410478 


.650964 
. 576308 
. 546554 
.482405 
-456685 
.424201 
.400602 


-617523 | 0.589584 
.551054 .529794 
. 524408 .505704 
.466628 .453199 
.443323 -431911 
-413765 .404811 
.392199 . 384964 


-565914 
.511663 
-489705 
441635 
422054 
397046 
378670 


.545620 
.496025 
.475872 
431575 
.413457 
. 390250 
373146 


ooococo 
ooococo 
ooooco 

cososos 


. 1000 
. 2000 
.3000 
-4000 
.5000 
.6000 
).7000 
.8000 
.9000 


Oo 


356286 
339432 
329389 
322042 
316105 
310984 
. 306325 
.301836 
. 297098 


385247 
360469 
345921 
335384 
. 326934 
.319693 
.313144 
. 306869 
. 300284 


377634 
.354977 
.341622 
.331926 
.324134 
.317447 
.311389 
.305576 
. 299467 


.371130 
. 350263 
. 337923 
. 328944 
.321716 
.315504 
. 309869 
304455 
. 298758 


. 3865509 
.346174 
.334707 
.326347 
.319607 
.313807 
. 308539 
. 803473 
. 298136 


360604 
.342594 
.331885 
. 324064 
.317751 
.312312 
.307367 
. 302607 
. 297587 


= 
~ 


ooocococ oe 
oooooosso 


. 9500 
.9750 
.9900 
.9950 
.9990 
.9995 
.9999 


. 294276 
.292481 
. 291003 
- 290295 
- 289385 
. 289174 
. 288897 


296381 
- 293905 
. 291872 
. 290898 
. 289648 
. 289360 
- 288979 


. 295842 
. 293541 
. 291650 
. 290744 
- 289581 
- 289313 
- 288958 


295374 
. 293224 
. 291457 
290610 
. 289522 
. 289271 
288940 


. 294963 
- 292946 
. 291287 
. 290492 
. 289471 
. 289235 
- 288924 


. 294600 
. 292700 
291137 
. 290388 
. 289425 
. 289203 
. 288910 


eocoococo 
ooocooo 
cooocoococo 
coococo 





P\n 20 22 24 26 28 30 





.0001 
.0005 
.0010 
-0050 
-0100 
.0250 


.442816 
.415362 
-403949 
.378283 
.367537 
.353542 
.343051 


.430137 
-405229 
.394841 
.371406 
. 361562 
.348713 
.3839058 


. 528037 
-482405 
.463796 
-422745 
-405893 
. 384251 
-368258 


499115 
.459851 
.443739 
.407978 
-393205 
.374147 
.359999 


-476339 
.441949 
.427766 
.396124 
. 382983 
.365968 
.353289 


.457955 
427406 
.414752 
. 386400 
.374574 
.359214 
.347731 


ooococco 
cocooco 
eooococo 
ooococo 
ooeococo 


.352456 
.336621 
.327165 
.320239 
.314636 
.3809798 
. 305394 
.301146 
- 296659 


.345963 
.331838 
.323374 
.317160 
.312123 
.307768 
.303797 
- 299963 
. 295907 


.340668 
.327922 
.320263 
.314628 
.310054 
- 306094 
302479 
- 298985 
295284 


.336268 
.324658 
.317663 
.312509 
. 308320 
304690 
.3801372 
. 298163 
. 294760 


.332553 
.321894 
.315459 
.310710 
. 306847 
.303495 
. 300430 
. 297462 
. 294313 


.329376 
.319524 
.313566 
.309164 
.305579 
. 302466 
. 299618 
. 296858 
. 293927 


cooococoocoocco 
oooocoocococo 
eeesssessse 
oeoocooocoo 
cocoocooco 


. 293986 
292284 
- 290883 
290212 
. 289348 
. 289149 
. 288886 


- 293488 
.291947 
- 290677 
- 290068 
289286 
289105 
288866 


. 293075 
. 291667 
. 290506 


. 292727 
-291431 
. 290362 
. 289950 . 289849 
. 289234 . 289190 
-289068 | 0.289037 
-288850 | 0.288836 


. 292430 
291229 
. 290239 
. 289764 
289152 
. 289011 
. 288824 


. 292174 
291055 
. 290133 
- 289690 
. 289120 
. 288988 
. 288814 


eossessce 
ooocoocooco 
oooco 





























USE OF SAMPLE RANGES IN SETTING CONFIDENCE BOUNDS 


TABLE 2.—(Cont’d.) 





34 


36 


38 


40 


50 





coo 


ooococcco 


eoooceoocco 


-0005 
.0010 
.0050 
-0100 
.0250 
.0500 


. 1000 
. 2000 
.3000 
.4000 
.5000 


6000 
7000 
8000 
9000 


9500 
9750 
9900 


. 9950 
.9990 
.9995 
.9999 


-0500 


. 1000 
. 2000 
. 3000 
.4000 
. 5000 
.6000 
. 7000 
.8000 
. 9000 


9500 
9750 
.9900 
9950 
9990 
.9995 
.9999° 





0.419369 
0.396586 
0.387059 
0.365506 
0.356427 
0.344551 
0.335610 


. 326628 
.317470 
.311923 
.307820 
. 304476 
.301571 
. 298911 
. 296331 
- 293591 


.291951 
. 290904 
- 290040 
. 289625 
. 289092 
. 288968 
. 288805 


coococooco 


ooocococeo 


ooocococo 


.410112 
. 389130 
.380335 
. 360389 
.351965 


340928 
332603 


-324227 
-315672 
-310483 
- 306642 
.303509 
.300785 
- 298289 
. 295868 
- 293295 


-291754 
.290770 
- 289958 
- 289568 
- 289067 
- 288951 
- 288798 


402071 
382633 
374467 
355908 
348054 
.337746 
.329958 


coococco 


-322111 
. 314086 
. 309211 
. 305600 
. 302653 
. 300089 
.297739 
. 295458 
. 293033 


eceoocooccoco 


. 291580 
. 290652 
. 289886 
. 289518 
. 289045 
288935 


|; ecoococe 


. 288791 


. 395022 
.376921 
. 369302 
.351954 
.344597 
.334928 
.327613 


0.320233 
0.312675 
0.308080 
.304673 
.301890 
. 299469 
. 297248 
. 295092 
292799 


cocococo 


. 291424 
-290546 
. 289821 
. 289473 
- 289025 
288921 
288785 


coocococo 


. 388794 
371861 
. 364722 
. 348438 
.341519 
.332416 
.325520 


ooococeo 


.318555 
311413 
. 307066 
303842 
.301207 
.298913 
296808 
294764 
292588 


ecooocecoooco 


.291284 
290451 
289763 
289433 
289007 
288909 
- 288779 


coococo 


0.366084 
0.353308 
0.347886 
0.335436 
0.330112 
0.323073 
0.317714 


0.312277 


.291793 


. 290755 
. 290091 
289543 
. 28927 

. 288940 
. 288862 
288758 








.351751 
.341509 
.337143 
.327076 
.322752 
.317016 
.312636 


.308179 
.303575 
. 300756 
. 298656 
. 296934 
295431 
. 294049 
. 292703 
. 291267 


. 290404 
- 289853 
- 289397 
.289178 
. 288896 
- 288830 
. 288744 


cooocooco 





70 





80 


90 


100 





.341887 
333347 
. 329696 
.321250 


0.317610 


.312772 
. 309069 


. 305293 
.301385 
. 298987 
. 297199 
. 295731 
. 294449 
. 293269 
- 292120 
. 290893 


-290155 
. 289683 
. 289293 





. 334687 
.327367 
.324231 
.316958 
-313817 
.3809634 
. 306426 


ooococococococo 


.303151 
. 299756 
. 297670 
. 296113 
. 294834 
-293717 
- 292687 
- 291684 
- 290613 


coceoocooco 


. 289968 
289556 
- 289215 
- 289051 
288840 
0.288791 
0.288727 


ooocococo 





.329201 
. 3822798 
.320049 
-313665 
310902 
307218 
304390 


ooeocccoco 


301498 
. 298497 
. 296651 
- 295272 
294140 
. 293149 
- 292236 
-291346 
0.290396 


coocoooo 


0.289824 
0.289457 
0.289155 
0.289009 
0.288822 
0.288778 
0.288721 





-324883 
-319193 
.316747 
.311058 
. 308593 
-305302 
302772 


ooocoocooco 


300184 
-297495 
. 295840 
294603 
293586 
292696 
291877 
291077 
290222 


ooocooococ o 


- 289708 
289379 
- 289106 
0.288976 
0.288807 
0.288768 
0.288716 











BIAS IN PSEUDO-RANDOM NUMBERS 


Paut Preacu 
System Development Corporation 


Some congruential pseudo-random number generators are shown to 
be subject to sub-periods or harmonics whose effect is to constrain the 
variability of the numbers generated. An experiment with such a gen- 
erator produced a long sequence whose variance was significantly less 
than the theoretical value. 


1. INTRODUCTION 


HE simulation of large systems through digital computer programs is 

becoming a commonplace. Most such systems are subject to random 
inputs, and to simulate these there have been proposed formulas, capable of 
being programmed, called pseudo-random number generators [4, p. 8] [5, p. 19] 
(7, p. 75] [8, pp. 15-28]. Numbers generated by a formula cannot of course be 
random, but their use is almost a necessity in the computers now generally 
available. 

The formulas most widely accepted are those of the “congruential” type. In 
these we may have a multiplication, an addition, and a division, the output 
being the remainder after division. One such formula, discussed by Rotenberg 
(7, p. 75] and Coveyou [3, p. 72] is 


Xnu1 =’Xn+ pu Mod 2¥ (1.1) 


where “Mod 2”” with the congruence sign = means that \X,-+4u is divided 
by 2” and X,4: is the remainder.* 

The parameter J is of the form (2?+1) where P is an integer greater than 1; 
u may be any odd number, and M any positive integer. We start with any con- 
venient Xo; the formula then generates all the integers from 0 to (2“—1). The 
proof is usually presented in number-theoretic terms, but such a presentation 
is not necessary ; Section 5 gives a proof that uses high school mathematics only. 

To illustrate the formula, suppose we take P=2, M=3, Xo=0, »=5. The 
formula then becomes 


Xavi = 5(X, +1) Mod8 


whence X,=5, X2=6, X3=3, X4=4, Xs=1, Xe=2, X7=7, Xs=0. 

The sequence then repeats itself. It can be shown that (1.1) has the period 
2”; most of its properties have been discussed in other publications [1, pp. 
261-5] [2, p. 353] [3, pp. 72-4] [7, p. 75]. Since M can be chosen large, 
limited only by the word-length of the computer—for example, on the Bendix 
G-15, using single-precision only, we can take 1/ =28—the period can be long; 
2° = 268,435,456. 

A good feature of congruential formulas (which has been pointed out many 
times) is that they can avoid the possibility of degenerating into a small loop. 





* The symbol “Mod 2™” is from number theory. Since most people who take courses in number theory forget 
it as soon as they have passed their examinations, and since the author would like to keep this paper as accessible 
as possible, the discussion will be in terms of simple algebra. ' 
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An early method, sometimes called the mid-square method, is subject to this 
danger. In the mid-square method we square an n-digit number and extract 
the middle digits, squaring these again, and so on. For example: suppose we 
start with 3712. 


Xo = 3712 Xo? = 13778944 X, = 7789 
X,? = 60668521 Xs = 6685 


and so on. We get X;=6892, X,=4996, X;=9600. Now the series goes into a 
loop: 9600, 1600, 5600, 3600, and back to 9600. This cannot happen with (1.1). 


2. STATISTICAL TESTS OF PSEUDO-RANDOM NUMBERS 


Number generators are commonly validated by showing that their outputs 
pass certain statistical tests. For example, Rotenberg [7, p. 75] using (1.1) with 
P=7, u=1, M=35, has examined 4096 numbers, presumably consecutive, 
tabulated class frequencies, and calculated the mean, variance, serial correla- 
tion, and frequencies of runs. His observed frequencies (in 8 classes) are 505, 
540, 511, 510, 481, 514, 508, 527; the expected value is 512, standard deviation 
21.166. He does not calculate chi-square; it comes out almost exactly 4, which 
for 7 degrees of freedom will be exceeded by chance about 78 per cent of the 
time, and so would not be taken as significantly small. 

Since (1.1) generates all the integers from 0 to (2%—1), it follows that for a 
sufficiently long sequence the mean and variance must agree exactly with the 
theoretical values for a uniformly distributed population. In Section 5 it is 
shown that the sequence has not only the period 2”, but also sub-periods or 
harmonics of all lengths 2‘, 0<i<M. These harmonics are subject to definite 
patterns; they constitute music rather than noise. For example consider the 
series generated by 


Xny1 = 9X, +13 Mod 32. 
It is 
SB 2a 0242.3: 3: Fes Se ae ee 
16 29 18 15 20 1 22 19 2 5 26 23 28 9 30 27 
0 13 


The first two lines comprise the two halves of a period, and we notice that 
corresponding numbers in the two half-periods always differ by exactly 16. 
Whatever the length of the period, the difference between corresponding num- 
bers of the half-period is 2¥—'. If we consider quarter-periods, the difference be- 
tween corresponding numbers is always a multiple of 2”~*. This relationship 
holds whatever the values of P and y; it is an inherent property of (1.1). 

The presence of these harmonics creates a problem that does not seem to 
have been recognized until now. For large M we are relying essentially on the 
hope that the higher harmonics will produce a discord undistinguishable from 
noise, Whether patterns introduced by the harmonics are objectionable or not 
depends, of course, on the application. If our purpose is to estimate an unknown 
mean without bias, periodicities may do no harm; even if the events to be ob- 
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served are themselves periodic, it does not necessarily follow that observing 
them at regular intervals introduces a bias (though obviously it may). The 
writer used (1.1) to estimate transients in a queuing problem [6, pp. 261-70] 
and obtained good agreement with theoretical results. It was in fact precisely 
the goodness of the agreement that led to the study described in this paper. 

In a large class of problems, we shall be interested in variances as well as 
means. Thus, in a queuing problem we might wish to know, not only the mean 
queue length, but also the probability of exceeding this mean length by chance 
by some critical amount within a given time. If the results of simulation are 
used as guides in deciding how much capacity to provide, an estimate derived 
by using periodic sequences may lead to under-design and subsequent failure. 
The point seemed important enough to justify an experiment. 


3. DESCRIPTION OF EXPERIMENT 


If the numbers generated by (1.1) can be used in place of random numbers, 
they must fall into frequency classes in agreement with multinomial sta- 
tistics. The fact that expected values are realized is not sufficient; undue uni- 
formity here would be as bad as excessive variability. The null hypothesis 
adopted is that the distributions are not unduly uniform; this is in keeping with 
Rotenberg’s results, which gave a value of chi-square less than expectation. The 
alternate hypothesis is that distributions are too uniform (constrained by the 
harmonics) and if this is true we should get a significantly small value of chi- 
square. 

Rotenberg’s sample of 4096 did not give a significantly small chi-square, so 
this experiment was based on two longer sequence lengths: 8192 and 32,768. 
Numbers were generated on a Bendix G-15, using (1.1), with 


P=11 
uw = 211,527,139 = .788 K 2” 


following Coveyou [3, p. 74] and Rotenberg [7, p. 75]. 
X, = 0 (not counted). 


As soon as the program generated a number it tested the three leading (high- 
order) binary digits and tallied the numbers in memory. When the sample was 
complete the totals were typed out, the tallies were reset to zero, and then the 
process continued from the last number generated. In this way 262,144 con- 
secutive numbers were generated and tallied, the first half in groups of 8192, 
the second in groups of 32,768. After the program had completed the first 11 
samples it was modified to type out the final random number counted in each 
sample; the first two samples were then recalculated as a check. The resulting 
12 numbers have some interest of their own, and are presented in the data. It 
is not hard to fill in the values for samples 3-10 inclusive. 


4. DISCUSSION OF THE DATA 


The results are a bit frustrating, though of course there is no doubt about 
their significance. For 7 degrees of freedom chi-square exceeds 2.17 95 per cent 
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of the time. This level of significance is not attained in any of the large samples, 
but it is attained by 3 small ones. If we add up the 16 values of chi-square for 
the small samples we get a total of 79.6; a value as small as this for 112 D. F. 
could be attained only about 2 per cent of the time. For the four long samples 
we get a total chi-square of 14.0, which for 28 D. F. has about the same proba- 
bility. 

The frustrations are two. First: it has taken a number of rather large samples 
to demonstrate the bias in the routine. The values of chi-square tend to be 
small, but not so small as to overwhelm the observer. Second: if we combine all 
262,144 random numbers in 8 classes for a single test with 7 D.F. we get chi- 
square equal to 6.02, quite close to the 50 per cent point (=6.3). This means 
that the bias, whatever its cause, may become obscured if the sample is big 
enough. Of course, we cannot assert that this outcome would be repeated in 
another experiment. It is interesting to note that in Table 1, several of the ob- 
served frequencies deviate rather markedly from the expected value, the devia- 
tion for Group 6 being 2.5 times the standard deviation; the probability of such 
a deviation is about .012, but of course the data cannot be used to test a hy- 
pothesis suggested by the data. 


5. PROPERTIES OF FORMULA 1.1 


In the following treatment X; is defined as an equality rather than a congru- 
ence, in order to make the proofs accessible to readers not at home in number 
theory. 

Consider the special case (Formula 1.1) 


Xn = AX, + 1 
with X,>=0, \=2?+1, P>1 (integer) 


$(A" + 1) is an odd number for all n. 


$(A" + 1) = 3([2? + 1)" + 1) 
= 4((1 + terms in 2?] + 1) 
= $(2 + terms in 2”) 
= 1 + termsin2?-', oddif P > 1. 


We note that (A*+1) contains 2 but no higher power of 2. 


X,=14+A+$M+--+ $2" 
By induction. 
Let s=2°, Q any positive integer. Then X, contains 2° but not 2°¢*'. 
By induction. Assume it true for some Q. Then 


Xu = 1+ A$ +++ FATE MLE AH ++ $A) 
= (1+*)X,. 
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By (5.1), 1+)* contains 2 exactly once; then if X, contains 2? but not 2¢+, 
X2, contains 2¢+! but not 2°**. But (5.3) is true if Q=1, since X,=2?+2. 


If n is odd, X, is odd. (5.4) 


We note that the X, are alternately even and odd. 


If 0<n<2°@, X, does not contain 22. (5.5) 


If n is odd, this follows from (5.4), and if n=2¥ it follows from (5.3). Otherwise 
n has the form R-24, where RF is an odd number. Then X, contains Xz and we 
can take out exactly L factors of the form (A*+1); whence X, contains 2” but 
not 24+', 


If Z, is the remainder when X, is divided by s=2™ then the sequence 
Z:, Zo, Zs, - + - Z,-1 includes ali the integers from 1 to s—1: that is, 
every possible remainder occurs exactly once. (5.6) 


By (5.5) we know that there is a remainder; ie., Z;#0 if 1<i<s. Assume 
s>j>i. If Z;=Z;, then X;—X; must contain s. But 
X; — X; = + dH 4 - - - + dy (6.2) 
=A(14+A+---+A%*) 
= AiX;_;. 
This cannot contain 2”, by (5.5): thus, Z;#Z; and no remainder occurs more 
than once. Since (s—1) remainders occur, (5.6) is proved. 


Z; _= Lite (5.7) 


The Z,; form a periodic sequence with period 2™. (5.8) 
If 


|¢ —j| contains 2°, so does | Z; — Z;|. (5.9) 


Xazi = AX, + 1 
0 < Xo < 2™. 
By (5.6), Xo is equal to some Z, say Zx. Then 
Ze = Xo=1+A4+°--+A*'—h-2¥ 
Xi =1+A+-°-++A* —Mh-2” 
Xp. =1+A+-+-- + AXt — \M-D™ 


and so on. The values of Zx41, Zx42, - + + are not affected by the terms in 2™. 
Thus, (5.6) remains true for arbitrary Xo. 
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Let 
Xnzi =AXa ty 
0 < Xe < 2, nw any odd number. 
Then 
Ze =Xo=wltaAt---+A*) —h-2¥ 
X, =pw(L+X+--+ +)%) — h-2™ 
and so on. If Z;=Z,;, X;— X; must contain 2”, that is, 


u(rAt + di +... + A) = 


wl +A+--->+AFH}) = 


Since pd‘ is odd, this is possible only if 7—7¢ contains 2”. Thus, if 7, 7<2™, 
Z;~Z; (unless of course i=j). This shows that no Z is repeated. But Z=0 if 
and only if K=2”—1. Thus we can state 


Theorem I. 


If \=2?+1 P>1 (integer) 
0<X_9<2” M>0 (integer) 
w=any odd number 


then the formula (1.1) X,4:=AX,+u Mod 2” generates a set of identical se- 
quences of length 2”, each containing all the integers 0 to 2” —1. 


If Xo>=0 and X,4:=AX,+u, then three equally spaced values X,, 
Xn4K, Xnsex cannot be in arithmetic progression Mod 2” if Kisodd. (5.12) 


If X,, Xnix, Xn4ex are in arithmetical progression modulo 2”, then 


Xn — Ske + XE, = ho" 


A"(AK — 1)Xx = h-2”. 


If K is odd, \* —1 contains 2” but no higher power of 2; Xx and \” are both odd. 


If K=2°2, then X,, Xnsx, Xn4ox will be in arithmetic progression modulo 
2™ if 20> M—P. So also if K is a multiple of 22. (5.13) 


If K =2%, 29> M-—P, then 
Xyix — Xv =D Mod 2¥ 


where D is a constant, the same for all N. 
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Sample 
No. 


Group 





1,010 
1,042 
1,026 
1,016 
1,036 
1,026 
1,021 
1,020 
1,021 
1,025 
1,077 
1,002 
1,005 
1,025 

989 


997 


5 


7 


S? 


tad 





1,003 
1,030 
997 
988 
1,055 


1,024 
981 
1,010 
1,023 
980 
1,028 
1,020 
1,022 
1,040 
1,040 
974 
976 
1,045 
1,033 
1,018 
988 


961 

979 
1,079 
1,027 
1,041 
1,003 

981 
1,019 
1,032 
1,053 
1,014 
1,070 
1,038 
1,018 
1,062 
1,034 


7,730 
5,610 
4,620 
12,774 


[se KF OR WN ROS 


- Oo te 
oar wmnwes 


wom to 


81,478 57 





Totals 16,338 16,467 16,399 16,382 16,190 16,202 16,683 16,411 





Devia- —46 +83 +15 -2 —1% —182 +299 


2? =170,124 
x?=10.4 


+27 


tions* 








* From the theoretical mean of 16,384. The standard deviation on a binomial hypothesis is 119.7 plus. 


6. CONCLUSIONS 


The pseudo-random numbers generated by (1.1) contain patterns and 
periodicities that act as constraints upon their variability. Their use in simula- 
tion is likely to lead to an appearance in the simulated system of greater sta- 
bility than the real system will have. Any results based on a simulation using 
(1.1) should be taken as probably good predictions of average outcomes, but 
poor indications of possible extreme conditions. 

There is evidence that P should be kept smail. Aside from reducing computer 


TABLE 2. FREQUENCY COUNTS FOR SAMPLES OF 32,768 








Sample Group 


No. 


0 1 5 








4,164 4,062 
4,064 
4,134 


4,174 


4,072 
4,071 
4,081 


4,075 4,157 





16 ,434 





Totals 16,296 16,449 


16 ,593 16 ,381 





Deviations +209 


+50 —88 


+65 —46 -3 77 ,940 





Totals & Deviations for All Samples 





32,931 
+163 


32,901 
+133 


32 ,695 
—73 


32,831 
+63 


32,528 
—240 


32,583 
—185 


32 ,983 
+215 


32 ,692 


—76 197,382 
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TABLE 3. PSEUDO-RANDOM NUMBERS 











N xX N 





54,288,384 
16,384 108,576,768 
90,112 60,301,312 
98,304 114,589,696 

106,496 168 ,878 ,080 

114,688 223, 166,464 

122,880 9,019,392 

131,072 63,307,776 

163,840 12,025,856 

196, 608 229,179,392 

229,376 177,897 ,472 

262,144 126 615,552 








From (5.13) we know that with M=28, P=11, the resulting pseudo-random numbers will be in arithmetic 
progression Mod 2” at intervals of 512: that is, Xn, Xnix, Xn42K,* * * , Will be in arithmetic progression if K =512 
or any multiple of 512, 


time, small values of P increase the length of the cycle for arithmetic progres- 
sion. Note that if P= M, the effect is as if P were zero; and if P= M—2, then 
by (5.13) alternate terms are in arithmetical progression. A study of Table 4 is 
illuminating. 

It must be obvious that if (1.1) is used and only some of the digits of the X’s 
are kept, the highest order digits should be the ones retained. To discard any 
of these merely has the effect of reducing M. 


TABLE 4 


Xap @AXat1 Mod 2!2 (= 4096) 
Xo=0 A=65 (P =6) 


The first 72 values in groups of 8, reading down the columns, are 








Column No. 





2 3 4 5 6 7 





808 2,608 
3,369 1,585 
1,898 626 

491 3,827 
3,244 2,996 
1,965 2,229 

750 1,526 
3,695 887 


0 1,800 3,600 1,304 

1 2,313 529 2,841 

66 2,890 1,618 346 
195 3,531 2,771 2,011 
388 140 3,988 3,740 
645 909 1,173 1,437 
966 1,742 2,518 3,294 
1,351 2,639 3,927 1,119 


no 


noe eR Oe WO = 
ROT R NK OO 








Note that (a) the numbers on any one line are in arithmetical progression Mod 4096, with common difference 
264 +-512k, where k takes on the values 3 (line 1), 4, 5, 6, 7, 0, 1, 2 (line 8); and 
(b) if j =64, then Xj,;—X; =2112 for all i. 
(c) if 7 =32, then X;4; -X; =3104 if ¢ is even. 
(d) if 7 =16, then X;,; —X; =3600 if ¢ contains 4. 
Other periodicities can be found. 
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7. SUGGESTIONS 


Since pseudo-random number generators are fundamental tools of simula- 
tion, some effort should be made to discover formulas with better properties 
than (1.1). The formula 


Xnui = AX, + K+n Mod 2” 


has been examined briefly, but preliminary results do not look promising. A 
possible modification is 


Xn41 = AX, + K+ L Mod 2” 


where L goes through some cycle with a period not a multiple of 2. Finally, the 
effect of making P small (2 is the minimum value) should be investigated. 
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A NOTE ON THE EXACT FINITE SAMPLE FREQUENCY 
FUNCTIONS OF GENERALIZED CLASSICAL LINEAR 
ESTIMATORS IN TWO LEADING OVER- 
IDENTIFIED CASES* 


R. L. BasMANNt 
General Electric Company 


The asymptotic unbiasedness and normality of alternative statistical 
estimators @, 0,| - + - of a given parameter 6* are generally proved with- 
out reference to any explicit knowledge of the exact finite sample dis- 
tribution functions, F(z), Ga(z) +++. (Here n denotes sample size.) 
Within the class of asymptotically unbiased and normally distributed 
estimators of a given parameter it is sometimes possible to demonstrate 
that one estimator possesses a smaller asymptotic variance than an- 
other, or that one estimator possesses the smallest asymptotic variance 
within a particular subclass. Asymptotic theory obviously does not 
predict anything about finite sample distribution functions F,(z), 
G(x) - + +. In particular we cannot deduce from asymptotic theorems 
that the estimator with the smallest asymptotic variance will continue 
to exhibit the smallest dispersion in finite samples. Consequently it 
remains an essential task in positive estimation theory to derive the 
exact finite sample distribution functions of the alternative estimators 
that appear to be promising on the basis of asymptotic considerations. 

In econometric statistics the alternative estimators of structural 
coefficients in systems of simultaneous equations appear to possess 
rather complicated finite sample distribution functions. Monte Carlo 
experiments are resorted to for the purpose of providing useful leads in 
mathematical research into the nature of these distributions. In this 
article are presented some leading results of a mathematical investiga- 
tion of the exact finite sample distribution functions of generalized 
classical linear estimators. The results presented here exhibit some par- 
ticularly important implications for the conduct and evaluation of 
Monte Carlo experiments in econometric statistics. 


1. INTRODUCTION 


AMPLING experiments that utilize Monte Carlo techniques have been widely 
S applied in the study of the finite sample distributions of several alternative 
structural coefficient estimators employed in econometric statistics, namely, 
the so-called full information estimators (FI), |17, 19], the limited information 
or least variance ratio estimators (LVR), [1, 2, 17], the generalized classical 
linear estimators (GCL) and Theil’s k-Class estimators, [3, 4, 5, 23, 24, pp. 
334ff |, and the direct least squares estimators (DLS). The use of Monte Carlo 
techniques has been motivated by the understandable desire to avoid the 
difficult task of working out the exact finite sample distributions and frequency 
functions of these alternative estimators. 





* Work performed partly in connection with Project Numbers 1300086 and 1300071, Technical Military 
Planning Operation, General Electric Company, Santa Barbara, California. This topic was first presented in a 
seminar held in the Department of Economics, Purdue University, April 25, 1960. It was also presented in a Special 
Statistical Seminar, Statistical Laboratory, Iowa State University, April 29, 1960. 

t I am indebted to the following persons for valuable criticisms of the original draft of this paper: Professors 
Gerhard Tintner, Arnold Zellner, Stanley Reiter, Mitchell Harwitz, Sudish Ghurye, Martin Shubik and Dr. 
Ernest J. Mosbaek. All errors and omissions remain my own responsibility. 
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In most of the sampling experiments published so far the authors have 
sought to provide experimental support to normative comparisons between 
two or more types of estimator in respect of finite sample bias and mean- 
square deviations, [21, 22; 25, p. 88; 27]. It is generally conceded that a rational 
criterion of choice between two alternative estimators of a given population 
parameter should be formulated in terms of the relative degrees of concentra- 
tion of their distributions about the true parameter point. If the finite sample 
distributions of the GCL, LVR, FI and DLS estimators are at the least 
approximately normally distributed and possess finite variances, then it is 
clear that any rational measures of concentration will depend predominantly 
on the second moments. We must be prepared to find, however, that not any, 
or, at least, not all of these alternative estimators are approximately normally 
distributed in finite samples, nor even possess finite variances. In respect of the 
FI, LVR, GCL and Theil’s k-Class estimators, and in respect of the DLS esti- 
mators, the existence of mathematical expectations defining means and vari- 
ances has been widely accorded uncritical acceptance.' 

Yet, if it is proposed to use the mean-square deviation as a criterion of choice, 
and if it is proposed to employ a Monte Carlo experiment to measure the sup- 
posed means and variances of alternative estimators, then it is essential to 
verify in advance, by mathematical analysis, whether the parameters, e.g., 
means and variances, of the exact distribution functions actually exist and do 
thereby allow the application of the Law of Large Numbers [12, p. 228] to go 
validly through. Suppose u, with range — © <u<-+ ©, is one kind of estimator 
of some population parameter @, and suppose that f(u) is the unknown fre- 
quency function of u. Suppose we construct a Monte Carlo experiment to gen- 
erate a sequence {uz} of mutually independent estimates, wu. If the iniegral 


+00 
M1 -{ uf (u)du (1.1) 


—o 


exists, i.e., ts finite, then, according to the Law of Large Numbers, for every «>0, 


pr{e tert _yl> db 0 (1.2) 





n 


as n— ©. If, however, this integral does not exist, then the average of estimates 
ux does not converge in probability to any finite value; in such a case, it is, of 
course, meaningless to speak of the Monte Carlo experiment providing informa- 
tion about the bias of u, and a fortiori it is meaningless to speak of the Monte 
Carlo experiment providing information about the mean-square deviation of u, 
if bias and mean-square deviation are understood in the usual sense of mathe- 
matical expectations. Fortunately, however, regardless of the non-existence of 
moments, it is possible to test hypotheses about the exact finite sample distribu- 
tion function by means of its sampling experimental counterpart 


F.(z) = © (1.3) 
n 





' See especially the paper by C. F. Christ [10, p. 840]. Professor Christ evaluates the results of some Monte 
Carlo experiments [7, 22, 27] solely by comparing mean-square deviations of the alternative estimators studied. 





SAMPLE FREQUENCIES OF GENERALIZED LINEAR ESTIMATORS 621 


where z is continuous and n(x) denotes the number of estimates 1, that do not 
exceed x, [16; 8] since 


lim F,(z2) = F(z) = f seoau, (1.4) 


no 


The result expressed by equation (1.4) does not require that the distribution 
function F(x) possess any finite moment whatsoever. [6; 7, Sec. 4 pp. 73-88]. 

In order to demonstrate the cogency of the moment existence problem in 
respect to GCL estimation, I shall present here the exact finite sample fre- 
quency function of a GCL coefficient estimator in each of two leading over- 
identified cases. In the first of these cases, the mean of the GCL estimator 
exists but the variance does not. In the second case the variance exists, but the 
higher moments do not.? 

It is appropriate to mention that even though the ezact distribution function 
F(x) of some estimator fails to possess moments of lower order (say) a vari- 
ance, it is still possible in many cases to approximate F(x) by a distribution 
function G(x) that does possess (say) a variance and even possesses moments 
of still higher order. Thus A. L. Nagar has made an important contribution to 
econometric statistics by working out formulas for the bias and moment 
matrices of approximate distributions for Theil’s k-Class estimators, [20]. The 
reader will easily satisfy himself that Nagar’s approximations do not depend on 
the exact distributions possessing a finite variance. Examine the exact frequency 
function exhibited in Figure 1 below. This frequency function does not possess 
a finite variance. Consider the approximation obtained by truncating the exact 
frequency function at the points »,= —3, »,=3. The approximate frequency 
function obtained in this way possesses finite moments of all orders. The ap- 
proximate distribution function will be an excellent one, indeed. 

As has been mentioned already, the use of Monte Carlo techniques for the 
examination of the finite sample distribution functions of GCL, LVR, FI and 
DLS structural coefficient estimators has been prompted by the opinion that 
these distributions are very complicated. The opinion is, I believe, essentially 
sound. For all that, derivation of the GCL frequency function given in Sec. 3 
provides some much needed perspective on the matter and constitutes a 
central topic of this article. 


2. ASSUMPTIONS AND SPECIFICATIONS 


Equations (2.1) and (2.2) afford the simplest typical example of a system of 
simultaneous structural equations that can be estimated by GCL, LVR and FI 
methods with different results. 





2 In an article published in 1932 E. C. Fieller [13] worked out the exact finite sample distribution and fre- 
quency function of the quotient of two non-singular correlated random normal variates with non sero means. 
[13, eqn. (24), p. 432]. The essential feature of the Fieller distribution is that, like the Cauchy distribution to which 
it is closely related, all of its moments are infinite. The Fieller distribution should be of practical interest to econo- 
metricians, for in the leading two-equation, just-identifiable case of structural estimation (26, Ch. 7] where no 
lagged endogenous variables are present, and structural disturbances are normally distributed, the maximum- 
likelihood (ML) estimates of structural coefficients are distributed according to Fieller's law. Consequently, it is 
not possible meaningfully to compare this (ML) estimator with (say) the so-called “direct least squares” (DLS) 
estimator of the same coefficient in respect of the criterion of mean-square deviation. A meaningful comparison 
has to be made in terms of distribution function properties more fundamental than moments. 
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— Yun + Baye + yuta t+ Yute + Nite + yutat yw ten =O (2.1) 
— Yu + Booyer + Yaita + Yo2ete + Yost + Yatut voter =O (2.2 
Boe — Bis ¥ 0. 
The variables yu, ¥ are called endogenous variables and are determined within 
the system of equations (2.1) and (2.2) are supposed to represent. The variables 
tn, ***, 24 are called pre-determined and represent lagged values of endoge- 
nous variables, and variables that are otherwise determined outside of the sys- 
tem represented by (2.1) and (2.2). 
The vectors (en, @2), =1, 2, -- +, n are identically and independently dis- 
tributed, and the joint distribution function of (e,, e2) possesses the following 
finite moments: 


E(eu) =0 
E(eres) = wi, 7,7 = 1,2. 

The random variables e;; and the pre-determined variables zn, - - + , 4 are 
independently distributed. Moreover, if no ry is a lagged endogenous variable, 
then the random variables e;;, i= 1, 2, are distributed independently of z,, where 
sl, 

Estimation is accomplished under the identifiability hypotheses, which are 
like 

Hi:yu = 0, ‘ : v13 ~ 0, yu +0; (2.3) 
H: Y23 = 0, (2 ; Y21 * 0, Y22 ~ 0. (2.4) 

The system (2.1) and (2.2) is represented in its reduced form by 
Ya = uta + tele + wiles + Tule + m0 + 10 (2.5) 


Yer = Tela t Tole + wastes + Weslea + 720 + Ne (2.6) 


where, under (2.3) and (2.4) 


Bote + Birer 
Biz — Boo 


Nt - 


Ct T Cte 


12 - . 
Biz — Boe 


The vectors (nu, m2), t=1, 2, -- +, m consequently are identically and inde- 
pendently distributed with the finite moments 


E(ni) = 0 
E (nen) al | t,J = l1,: 
The random variables ,;, i= 1, 2, and the pre-determined variables x, - - - , t 
are independently distributed. Moreover, if no zx is a lagged endogenous vari- 


able, then the random variables 7;; are distributed independently of x,, where 
st. 


As the reduced-form coefficients rx, i=1, 2; k=1, 2, 3, 4, 0 in (2.5) and (2.6) 
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Fig, 1, Exact Frequency Function of the GCL Estimator vi. 
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are well-defined functions of the structural coefficients in (2.1) and (2.2), the 
following restrictions on the former are implied by the identifiability hy- 
potheses (2.3) and (2.4); 

H, implies, but is not implied by, 


P WiWi2). i 
matrix is of rank p; < 1; 
721722 
not both ro; = 0 and woz = 0. 
H; implies, but is not implied by, 
7 WisTia |. 
matrix | | is of rank p2 < 1; 
7237 24 


not both r23 = 0 and ma, = 0. (2.8b) 


In the sequel we shall assume (because it will ease the reader’s task in follow- 
ing the argument) that 


(2.9) 


moreover, that 


N 


2 Yu = 0, 


t=1 


Mi 
> tn = 0 
t=1 

and that 

=Om+#k 


Vv 
7. LtkX tm 
t= 1 


9 
=im=k. ae 
We assume that none of the x,, k=1, 2, 3, 4 is a lagged endogenous variable. 
The specialization (2.12) is equivalent to the statement that the sample of 
exogenous variables has been orthogonalized. The xx are the orthogonalized 
forms. The r% are supposed to have been adjusted conformably with such an 
orthogonal transformation. We assume that the disturbances en, e2 are nor- 
mally distributed. We denote by pu, ---, pis, Pu, ***, Po the unrestricted 
maximum likelihood estimates of mu, - - - , 4, 72, * * * , To. Under the condi- 
tions assumed, the px are distributed according to 


f(pu, +++, pealdpiu + - + dpa 


l 
= —— exp [—3{ (pu — mi)? + --- + (pu m2s)?} ]dpur ++ +dpoy. (2.13) 
(2x)* 
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From (2.13) and the transformations (3.5)—(3.12) and (4.3)-(4.10) that fol- 
low, we shall derive the joint frequency functions of the GCL estimates of 
equation (2.1) in the following cases: 

Case I. yn =0, yi2=0. Of the K(=4) exogenous variables appearing in both 
structural equations. K.(=2) are excluded from the equation (2.1) to be esti- 
mated. There are G(=2) endogenous variables appearing in equation (2.1). 
Therefore, Kz—G+1=1. Necessary conditions for over-identifiability are met, 
[26, pp. 156-7]. In this ease the GCL estimates of Sy», 713, yu possess finite 
means but do not possess finite variances. 

Case IT. yn =0, ¥2=0, ¥13=0. Here K2=3, G=2, so that Ke—G+1=2. In 
this case the GCL estimates possess finite variances but no higher moments. 

These results can be demonstrated by means of the exact marginal frequency 
functions of GCL estimates of 8,2. alone. The results are independent of the 
sample size, N. For Case I we shall also present the sub-case in which 6,2.=0, 
as well as the general sub-case in which 6,240. For Case II we shall present 
only the sub-case in which §:.=0, as this will be sufficient to demonstrate the 
existence of a finite variance. 

In following the derivations in Sec. 3 and Sec. 4 below the reader will note 
that the assumptions (2.9) about the variances and covariance of the disturb- 
ances nu, M2, and the assumptions (2.11) and (2.12) about the moments of 
exogenous variables are of no consequence except for minor parameter changes 
in the exact distributions arrived at. For instance, generalization of the fre- 
quency functions (3.48) and (4.14) to the case in which o1. +0 is straightforward. 
Moreover, an important implication of the orthogonalization of exogenous 
variables is that 

ot -+- me > 0, N—>», (2.14) 
provided that at least one of the original, unadjusted reduced-form coefficients 
12, Te is not equal to zero. (See fn. 3 below.) 


3. DERIVATION OF FREQUENCY FUNCTION. CASE I 


In the application of GCL theory to equation (2.1) the statistics of chief 
interest are the estimates of coefficients f12, y:3, ya and the identifiability test 
statistic, [4, Sec. 5; 6]. The GCL method resoives the sample information con- 
tained in the estimates pu, ---, Pus, Pa, ** * , Pes, into two parts: that which 
is to be used in the estimation of S12, 713, Yu, and that which is contained in the 
numerator of the identifiability test statistic. We therefore express these GCL 
statistics as functions of the reduced-form estimates pu, - + - , Ps} Por, * * * » Do 

We have 


Piurpe21 + Pi2P22 : 


pai + Pox 


= (pir — Vipar)? + (pie — v1p22)?; 
= Piz — Yipes; 


U4 = Pia — VIP; 
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where v; is the GCL estimate of 612; v3 is essentially the numerator of the identi- 
fiability test statistic; vs is the GCL estimate of 713; vs is the GCL estimate of 
vu. The essential feature of the GCL method is the minimization of the quad- 
ratic form (3.2) with respect to x; [3; 4, Sec. 3; 5; 6]. We thus obtain the form 
(3.1) by setting equal to zero the partial derivative of v} with respect to v; and 
solving for »,. We obtain the GCL estimates v3, v4, by making the appropriate 
substitution in equations (3.3) and (3.4). 
Formulas (3.1)—(3.4) suggest the following transformation: 
Pi = rv; cos 8 — ve sin 8 (3.5) 
= rv, sin 8 + v2 cos 0 (3.6) 
=r cos 0 (3.7) 
r sin 6 (3.8) 
= V3 + VWs (3.9) 
Ve + Vie (3.10) 
= U5 (3.11) 
V6. (3.12) 
We note that 
Pir COS 8 + Pie sin 0 


v 1 — — 


r 


by means of which (3.1) can be deduced from (3.7)—(3.8). 
We also note that 


Pu — Vipa = — v2 sind 

Piz — Vipe2 = Vv’2COS 0, 
from which (2.2) can be deduced. It is also not difficult to show that v2 can be 
expressed by 


V2 = PiP22 — Pi2P21, 


Ve = — PiupPo22 + Pi2P21, 


under the special assumption (2.12). 
The new variables are defined over the domains 


0<r<s-a, 

0 <6 < 2z, 
Su<+o, 
Sul+, 
SuS+, 
SusSst+o, 
S%S+ 2, 


eS t+ ~, 
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The Jacobian of the transformation defined by (3.5)—(3.12) is 


(pu, “++, Dos) 


—_—___ = ¥ 


5(v1, v2, r, 6, ieee" 


2 


(3.13) 


Since the new variables 7, r, @ do not appear in connection with pis, piu, pos 
and px in (3.9)-(3.12), and since the new variables vs, v4, vs, v6 do not appear in 
connection with pu, pis, Pa, Px» in (3.5)—(3.9) it is natural and convenient to ex- 
press the probability density function (2.13) as the product of the two inde- 
pendent factors as in (3.14). 


l 
S(pu, +++, Pea)dpu:- + dpa = (Qn)? P [—4Q:]dpudpisdpadpo 
w)* 


oat (3.14) 
Xx an)? exp [—3Q2]dpisdpidposdpog 
where 
Q: = (pu — wu)? + (pi2 — wiz)? + (por — 21)? + (paz — m2) 
and 
Qe = (pis — mis)? + (is — 1a)? + (p23 — 123)? + (pos — 24)?, (3.16) 


before making the substitutions (3.5) through (3.12). 
Upon making the substitutions (3.5) through (3.8) in (3.15) we obtain 


Q: = (1 + 0)*r? — 2[r(riw + m2) + Vom 12] cos 6 


= 2[r (a1 + 422) — vows] sin 6 + v2 
2 2 2 2 
+ mir + 12 + 121 + Tee. 


We suppose that Si.=8. Since mu = Bra, 112= B22, we let 


A? = (1 + B*)(w21 + 22), (3.18) 
Z, = Tr2i(Bv1 + 1) + ro2Bre, (3.19) 
Z2 = — rr22(Bv1 + 1) + warBre. (3.20) 


Making the substitutions (3.18) and (3.19) in (3.15) and using 


, T 
sin @ = cos{ 60 — —}, 
2 
Tv 


Q,= (1+ vi)r? + vs — 2Z, cos @ + 2Z2 cos (6 — =). 


we obtain 


9 


The transformation of Q2 is somewhat more tedious: we first make the sub- 
stitutions (3.9) through (3.12) into (3.16); then we complete squares on v; and 
vs. Furthermore, we find it convenient to make the following additional trans- 
formation: 
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Xs 013 — F101 — Te3 


V5 = 


Vite) 4%) 





ae 


in Qs, and, consequently 


/ ( 1 = + v2) — 
dX, 
Vat) ; 


Finally we have 


O (v3 — 13 + V1%23)* + (v4 — Tut V1%24)? 


* 


Using (2.14), (3.8) and (3.27) in (3.14) we obtain 


g(V4, Ve, 7, 9, Vs, Vs, X5, Xe)AX AX sdvadvsdOdvedrdv, 


2 2 
1 2 l+n. Ve 
=z —_— 4 exp _ 4 _— —— --—-— 


9 


_ ~ = 


(27) 


T 
“exp |Z: cos 6 — Z2cos (6 — =) r°dédv.drdv,; 


l | 1 {es — 413 + Vym23)? + (V4 — Tin + nme 
x — — exp -< —~—— — ea pet “a aa atm dv3dv4 
2r(1 + vj) . 1+ 0; 
1 
X = exp [—4{ Xs+ Xe} ]dX dX. (3.28) 
rg 


The exact joint frequency function h(1, v3, v4) we are seeking is the coefficient 
of dv,dvsdy, in the expression that we obtain by integrating the right-hand 
member of (3.28) with respect to dX, dX5, d0, dv, and dr over the corresponding 
ranges of integration. The form of (3.28) indicates, however, that the joint 
probability density function of 1, v, vs is of the form 

h(v,, V3, v4)dv,dv3dv4 = hi(vihe(ry Ve, v3)dv,dv3dv,4 
nu igen (3.29) 
= hi(vi doi f(X 3, X4)dX gd X 4y 
where 


= X¥;/1+ vt — V1%23 + 713, 


= XivV1 + vi — Uym4g + 1 14. 
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a(Xs, X.) 
h v ; Ve, v = — ty 
2( 1 2 3) : Qn(1 + 0?) 
ve |-; (os rut orn) + (> eu tot 1) — 
- 1+ 





1 : 
f(X2, X.) = —exp[-4{Xs+ X4} ]. 
at 


Thus we see at once that the statistics 


i. oe (mis pe 4723) 


X3 a 
V(1 + v3) 





(3.35) 


i estat an (3.36) 
V(1 + v) 





are mutually independent standard normal variates and are not dependent on 
v;. This result, however, is easily deduced directly from equations (3.3)—(3.4). 
Consequently it is apparent that we require only to find the function hi(™) 
in (3.29). 

Making the substitutions (3.30) through (3.32) in (3.28) and integrating the 
latter with respect to dX, dX5, dX4, dX; over— ~ <X;,<+%, we obtain 


' ee (1 + v3) v 
9i(v1, V2, 7, 9)dbdvedrdv, = > exp |- een 9m - 4 


9 9 
_ _ 


av 


[ ) (3.37) 
exp |Z, cos 6 — Z,cos (6 — 2/2 
x ve he nto ( : 2) r°ddvedrdv. 





2r 


Integrating (3.37) with respect to dé over 0S@S2z7, we obtain 


P 2 2 

e~ /2 (1 ob v1) =] 
-exp| — —— 2 — |r? 3.38 
exp| r r 3 r ( ) 


-Iof VZi + Z3} dvedrdv,, 


92(01, V2, r)dvedrdv, = 
7 


where J (Z) denotes the modified Bessel function of the first kind of zero order, 
and Z;, Z, are defined as in (3.19) and (3.20). 

Recall from (3.19) and (3.20) that Z, and Z, are functions of both v, and r. 
Three considerations, namely, the algebraic forms of the estimated sampling 
variance of », [ef also 4 formula (3.15), p. 102], the necessity for integrating 
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(3.38) over v2 and r independently, and the form of the Generalized Addition 
Formula for Io (Z), [13, pp. 128-9, p. 95] lead us to make the following trans- 
formation: 


bh=v 1 + T (Bv; + 1) (3.39) 


bo = Vr + 1h B. (3.40) 


from which it follows that 


Zi + Zy = by? + bys (3.41) 
consequently 


Iyi(VZ2 + Z) = Io(V/b?r? + 022) 


= Io(bir)Io(bwe) + 2 Do (-1)"Zan(bir) In(bav2) 


n=1 


by the Generalized Addition Formuia for I(Z). 
Substituting (3.42) into (3.38) we obtain 


*/2 (1 + 0))r? 
— Jexp |- 2 | To(bir)re 04 2T(bove) 


2T 


9 


- (1 + v))r? i 
+2 > (—1)" exp |- ~ ; -| Ton(byr)r? +e”? *Taa(bars)¢ dveodrdv. 


n=1 ad 


We have now to integrate (3.43) with respect to dr over 0<r< «, and with 
respect to dv, over — © <v2< 4+. 

Since the Bessel functions I2,(by2), n=0, 1, 2, - - - , contain even powers of 
ve only, we have 


2 
+2 e = be 
es 2Ton(bev2)dve = V2r eh /4], ( ). (3.44) 


4 


—2 


Letting 


we obtain 


= (1 L y,)r? 
| eapi-- = - To,(byr)r2dr 


V20(n + (GX | 3 Xi 
= ——— i 31 8 +f - ; n+ 1; =*) 
(1 + v2) (2n + 1) 2 2 


(3.45) 


by Hankel’s generalization of Weber’s first exponential integral, [28, pp. 393- 
394], where 1F;(a; p; 2) denotes the confluent hypergeometric function, [29, 
pp. 337-51]. Upon integrating (3.43) over —» <m<+o, 0<r<+o, we 
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obtain by using (3.44) and (3.45), the exact finite sample frequency function, 
hy(v;) or v;: 


end? /2+ 03/4 f 3 : xX, 
vio Sarat OC) a(S 0 8) 
Vr(i + 924 \2 4 2 2 


3\ /Xi\" 
: r(n ‘o fe 3 Xi 
e 441 
A ad a I ats 2B 3 — ;2n+1;—)$; (3.46 
+ 2 | 1)" r'(2n + 1) TS ad >) ( ) 


—-“ox<wy< + @, 


Consequently, the exact joint frequency function is 


h(v1, vs, V4) = Ay(vi)he(v1, v2, vs) = hia(vs) onl +) : 
(3.47) 


{ 1 E — 113 + 0123)? + (Vs — 1g + me} 
“exp —_- . 
2 1+ vj 





The function he(v:, v3, v4) is the conditional frequency function for v3, v4, 
given v;. When it has been shown that 1, the GCL estimator of 812 in (2.1), does 
not possess a finite variance, it will follow that v3, and v4, the GCL estimators 
of yi; and yy in (2.1), also do not possess finite variances. 

To show that the frequency function (3.46) possesses a finite first moment, 
but not a finite second moment, we require only to examine the first term in 
the following series expansion: 


hi(o:) = —=——___ (3.48) 
J/ 7 


where 


To (3 
4 


3 
A,» =T - - - 1)* PRT 3.49 
(m + 9 ) m ln ! " j (m ra 3) '(m a s) ! ( ) 


It will be convenient, however, to introduce here the special sub-case in 
which 6.=0. If B2=0, then b:=0, A°=b|=73,+7%, and the frequency fune- 
tion (3.46) reduces directly to 


Mi(v1) = ber a8 at) 


aT esp pe emer bi 


2(1 + v2) ( eat s!s! L2(1 + v3) 
-— & <vu< t+ 
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Using 
a dv, / St 
f ~ ne -++, (3.51) 
Jig (1+)*t*?? (3/2 + 8) 
we find, upon integrating hf (v;) in formula (3.50) that 
+0 | 
hi(v;)dv, = |, (3.52) 
-2 
We are thereby satisfied that h}(v:) is indeed a frequency function. Now using 
7 v,dv, 
f —_——_——  =(0 s=0,1,2,-- 
we (1 + 0,2)*+8/2 


we obtain 
+2 
+* 
f vyhy(v,)dv, = (). 
2 


We see that the frequency function At (v1) possesses a finite mean. Finally, 
using 


2 
- vydv, «/2 
= (sec a — cosa)da = + ~, 
Y r 


, a+r 


/2 


we obtain 


+2 2.¢ 
f vyh; (v;)dv; = -{- eo, 


—oe 


Thus we see that the frequency function hf (v,) of the GCL estimator 
possesses a finite mean, but not a finite variance. It is a simple matter to show 
that the mode, m, of the frequency function Af (v;) is located at the origin 
v,=0. 

The frequency function hf (v;) with ¥=4.0, Bi2=0, is illustrated graphically 
in Figure 1, where it is also compared with the normal frequency function 
@ (0, 1/A*); this comparison illustrates very clearly the inadequacy of the 
mean square deviation as a measure of efficiency.* 





* We can, of course, try to approximate the frequency function h?(v1) by a frequency function that does possess 
a finite variance, e.g., Pearson's Type VII in this case. Dr. Ernest J. Mosbaek has tabulated the more general 
form of frequency function (3.48) for 81:0, [7, pp. 42-3}. The latter is skew, with the property that the mode lies 
between the origin, v1: =0, and the exact parameter point, vi =812. Pearson’s Type IV is a plausible approximation 
to the frequency function (3.48). In the more general case involving three equations treated in the sampling ex- 
periment (7, Sec. 3, 4], Pearson's Type IV emerges as a suitable approximation to each of the GCL estimator dis- 
tribution functions examined. 

Recall formula (2.14). Note that as the sample size increases indefinitely, i.e., as N+ «, we have 


bs 
u-—-~ ow, 


and  ¥V(v1—8) asymptotically unbiased and normally distributed, [4, pp. 103-5]. Moreover, for any fixed N, 
the distribution of v: will tend to be approximately normal and unbiased whenever \? — (63/2) is large, for whatever 
reason, [7, pp. 50-7]. 
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4. DERIVATION OF FREQUENCY FUNCTION. CASE II 
We shall now consider the exact finite sample frequency function of v, the 
GCL estimator of 82, when the identifiability hypothesis is 


Ho: yu = 0, y2=0, vis = 0. (4.1) 


This example is of importance, for it provides the first case in which the GCL 
estimator possesses a finite variance. Moreover, when we consider the special 
form in which Bj.=0, and in which sufficient conditions for identifiability are not 
met, namely, where 

T21 = T22 = 23 = 0 (4.2) 
in (2.6), we encounter the first case in which the sampling variance of a GCL 
estimate does not tend to infinity when sufficient conditions for identifiability 
are not met, [ef. 17 p. 175]. The existence of finite moments of the GCL 


coefficient estimators is not connected with the sufficiency of identifying restric- 
tions. 


It will serve our immediate purpose, which is to forestall premature gen- 
eralization of the propositions about the non-existence of finite variances of 
GCL estimators, to consider only the special case just mentioned; namely, 
where the equations (4.2) hold and £:.=0. Of course we do not assume that the 
statistician has this information. 

The GCL coefficients 1, v4, and statistics v2, vs are related to the unrestricted 
reduced-form estimates pu, «+ + , Po by the transformation: 

= vir sin 6 cos @ + v2 cos cos @ — v3 sin d (4.3) 
vir sin 6 sin @ + v2 cos 6 sin d — v3 cos (4.4) 
= vir cos 8 — ve sin 0 (4.5) 
V4 + VWs (4.6) 
r sin 6 cos @ (4.7) 
r sin 6 sin @ (4.8) 
r cos 0 (4.9) 
= Us (4. 10) 
S¢ <> a, 
0<¢ < 2r, (4.11) 
0<0< =; 
where v; is the GCL estimator of By. in (2.1) and »% is the GCL estimator of 
yu in (2.1); and 


2 2 
ve + 03 = (pu — Viper)? + (Piz — V1D02)? + (pis — vipes)? (4.12) 


is essentially the numerator of the GCL identifiability test statistic. The Jacob- 
ian of (4.3) is 


J =r' sin 0. (4.13) 








63 AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1961 
Using 82=0, and (4.2), we transform the probability density function (2.13) 
according to (4.3) and (4.13); integrating with respect to v5, v4, v3, v2, %, 7, 9, 
we obtain the exact finite sample marginal frequency function of 7: 


— © <5 < + @. (4.14) 


The frequency function (4.14) possesses a finite mean 


* 2v;dv; 
M1 = oe ay ape 
r(1 + v3)? 


=e 


2 2v,dv, 
Me = a ar a “Flies 
(1 + 0)? 


and a finite variance 


—2o 


The frequency function (4.14) possesses no finite moments of higher order than 
two. 

The results (4.16) taken together with the results (3.49) and its consequences 
suggest the conjecture that the exact finite sample distribution functions of 
GCL coefficient estimators possess finite moments of order y= Kz—G+1. The 
proof of this conjecture is beyond the scope of the present study. 

The results of Sections 3 and 4 continue to hold if the normally distributed 
disturbances en and é in equations (2.1) --(2.2) are serially correlated, pro- 
vided, of course, that no lagged endogenous variables appear in the structural 
equations, and that the sample of exogenous variables 2, ---, 24 remains 
fixed. In this case, the estimates pu, - ++, pu are normally distributed and 
unbiased; their variances, however, depend upon the correlograms of na, 
ne, t=1, 2,---+, N, and upon the sample serial covariances of exogenous 
variables. The precision with which the exact distribution functions of GCL 
estimates can be approximated by the normal form, or by some other con- 
venient distribution functions, is, of course, affected by the correlograms and 
the sample serial covariances of exogenous variables. 


5. CONCLUSION 


While it is true that the forms of the exact frequency functions (3.46) and 
(4.14) depend on the normality assumptions made in respect of the disturb- 
ances €, and é, in the structural equations (2.1) — (2.2), and on the assumption 
that no x» is a lagged endogenous variable, still it is plausible that the GCL 
estimator v; in Section 3 does not possess a finite variance even when the fore- 
going assumptions are relaxed. 

The results derived in Section 3 and Section 4 provide a cogent argument 
against the uncritical practice of casting away the so-called “outliers,” i.e., 
the extreme deviations encountered in Monte Carlo studies of estimator dis- 
tributions, especially in those cases in which the number K;2 of predetermined 
variables excluded from a structural equation is exactly equal to the number, 
G, of endogenous variables present, ie., Kz-G+1=1, [22], and a fortiori in 
those cases for which K.—G+1=0 [7, p. 15]. 
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It is clearly not possible to make any meaningful comparison between the 
GCL estimator », in Section 3 and the corresponding LVR, DLS, and FI 
estimators of 812 in respect of putative mean-square deviations. The same is true 
in respect of the three-equation case (@=3) corresponding to case I in Section 
3, i.e., in which Ke, the number of exogenous variables hypothetically excluded 
from the structural equation is K2=3, so that Kz—G+1=1.4 When the exact 
finite sample frequency functions of the corresponding LVR, DLS, and FI 
estimators have been worked out and are available, and when some appropriate 
measure of dispersion has been invented, it will then be possible to undertake 
some sort of meaningful normative comparisons among these estimator types. 
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DISTRIBUTIONS OF CORRELATION COEFFICIENTS 
IN ECONOMIC TIME SERIES 


Epwarp AMES AND STANLEY ReIrEeR* 
Purdue University 


This paper presents results, mainly in tabular form, of a sampling ex- 
periment in which 100 economic time series 25 years long were drawn 
at random from the Historical Statistics for the United States. Sampling 
distributions of coefficients of correlation and autocorrelation were 
computed using these series, and their logarithms, with and without 
correction for linear trend. We find that the frequency distribution of 
autocorrelation coefficients has the following properties: 


(a) It is roughly invariant under logarithmic transformation of data. 

(b) It is approximated by a Pearson Type XII function. 

(c) It approaches a rectangular distribution symmetric about 0 as 
the lag increases. 


The autocorrelation properties observed are not to be explained by 
linear trends alone. 

Correlations and lagged cross-correlations are quite high for all 
classes of data. E.g., given a randomly selected series, it is possible to 
find, by random drawing, another series which explains at least 50 per 
cent of the variances of the first one, in from 2 to 6 random trials, de- 
pending on the class of data involved. The sampling distributions ob- 
tained provide a basis for tests of significance of correlations of eco- 
nomic time series. We also find that our economic series are well de- 
scribed by exact linear difference equations of low order. 


1. INTRODUCTION 


INCE 1913, when H. L. Moore first undertook to measure the elasticity of 

demand for agricultural commodities, economists have been trying to ex- 
tract information about economic relationships from time series data. Since 
that time, a variety of statistical models have been employed to analyze time 
series data. Despite the considerable sophistication of the available statistical 
models of time series, we cannot yet claim to know the class of statistical models 
appropriate to economic time series. An accumulation of systematic informa- 
tion about statistical properties of economic data would improve our ability to 
decide the relevance of alternative statistical models, and suggest the lines 
of development of new models, if that should seem desirable. This paper is a 
step toward systematizing information regarding statistical properties of ob- 
served economic time series. 

This paper presents frequency distributions of coefficients of correlation and 
autocorrelation computed from annual economic data. We chose the correla- 
tion properties of economic time series for our first step, because of the rele- 
vance of these statistics for current theoretical and empirical work. Regression 
models are perhaps those most widely used for estimating economic relation- 
ships. In these models, the correlation coefficients obtained indicate the amount 
of association observed. The significance of such association is usually evalu- 





* This work has been in part supported by the Office of Naval Research. 
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ated by computing the probability of a correlation as large or larger than the 


observed correlation on the hypothesis of no more than chance association. The 
significance test appropriate for such applications should be based on the prob- 
ability distribution of correlations among randomly selected economic variables 
rather than on the distribution of correlation coefficients among variables from 
another uncorrelated population. Thus, the frequency distributions of correla- 
tion coefficients presented in this paper provide a more appropriate basis for 
tests of significance of correlations among economic variables than does the 
distribution of the correlation coefficient under the usual nul! hypothesis of 
statistical theory that the expected correlation is zero. 

It is known that estimates of the coefficients in linear autoregressive models 
may be obtained by correlation techniques; and that properties of such models 
may be formulated in terms of the roots of their characteristic polynomial. 
Hence, a knowledge of the empirical distribution of autocorrelation coeffi- 
cients points the way to a study of autoregressive structures which represent 
economic processes. In this sense, our sytematization of this knowledge could 
assist ultimately in the selection of stochastic models suitable to describe eco- 


nomic processes. 


2. DESCRIPTION OF THE DATA USED 
We took a random sample of 100 economic time series from the //istorical 
Statistics of the United States,’ using the RAND random digits to select them. 
The volume contains 2,994 columns of data, of which roughly half are annual 
series covering the period 1929-1945. These we extended through 1953, using 
the annual Statistical Abstract. We thus accepted the definition of “economic 
data” used by the Census Bureau. This hundred series for the 25-year period 
1929-1953 form the raw material for our work. 
Each series was then considered in four possible representations: 
1. In the units originally measured, called natural numbers; 
2. As residuals about a linear regression function z=a-+bt, where t, the in- 
tegers 1, - - - , 25, stands for “time,” i.e., “corrected for trend”; 
3. As logarithms;? and 
. As residuals about a linear regression function log;y) z=a+bt, where ¢ is 
as before, i.e., “logarithms, corrected for trend.” 


The use of “trend-corrected data” requires comment. There is a view in the 
economics profession that economic processes may be additively separable into 
“long-term” processes, which depend upon slowly-changing factors such as 
population, technology, resource endowment, etc., and “short-term” processes, 
including primarily the factors loosely describable as “the business cycle.”* A 
part of the economic literature associates discussion of “the business cycle” 
with trend-corrected empirical data, and for this reason, we have felt it would 
be instructive to include “trend-corrected” data in this study. 








1 U. S. Bureau of the Census, Washington, 1946. 

2 Two of the series chosen contained negative numbers, and had to be replaced by other series for the third 
and fourth representations. 

* This problem is discussed by Edward Ames, “A Theoretical and Statistical Dilemma,” Econometrica, 1948. 
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3. DESCRIPTION OF THE CALCULATIONS 
Each of these representations was then processed as follows: 


The coefficient of autocorrelation was computed for each series, for lags 
of 1, 2, 3, 4, 5 years. 

Successive pairs of series were correlated (in the random order of selec- 
tion); the second series in each pair was treated as the independent vari- 
able in a linear regression equation, and lagged 0, 1, 2, 3, 4, 5, years be- 
hind the first (dependent) variable. 

Since these various coefficients involved different degrees of freedom, all 
of the coefficients were made comparable by means of the transformation 


— ) 7/2 


n saad 
ee a er 
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7 is the corrected correlation coefficient; 

r is the uncorrected correlation coefficient; 

n=25—L, where L is the lag, and 25 the number of years for which 
data were collected. 


where 


In the process of deriving the trend-corrected data, correlation coeffi- 
cients of the natural numbers and the logarithms with “time” (the in- 
tegers 1, 2, - - - , 25) were computed. 

Frequency distributions of the corrected coefficients of correlation and 
autocorrelation were prepared, for each lag and each representation of 
the data. 

The mean; the second, third, and fourth moments about the mean; and 
the coefficients 8, and 8:4 were computed for each lag and each represen- 
tation of the data. These were computed from the individual observa- 
tions and not from the frequency distributions, so that no corrections for 
grouping were required. 


4. SUMMARY OF THE FINDINGS 


The frequency distributions resulting from our calculations, and moments of 


these distributions, are given in the Appendix. We may summarize these tables 


as 


follows: 


4.1 Autocorrelation is clearly very great in unadjusted economic series (both 


natural numbers and logarithms). Some 70 per cent of the variance of such 
data is “accounted for” on the average, if we correlate any series with 
itself, with lag of one year (Tables A-I, A-IIi). The same series, correlated 
with the integers 1, 2, - - - , 25 give U-shaped distributions of correlation 
coefficients, with modes around +.90 and —.70 (Table B-I). Although the 
average coefficient of autocorrelation declines as the lag increases, it is still 
.48 for a lag of 5 years. A series having autocorrelation close to +1 for lag 
of one year will be approximated closely by a straight line. Likewise, a series 
correlated with “time” will tend to increase by a constant amount each 





‘If uj is the ith moment about the mean, 8: =g"/uo? and Bs =pguo*. 
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year, i.e., its value in any year will roughly equal the value of the preceding 
year plus a constant. In this sense, trend and autocorrelation analysis 
measure the same thing. 


To determine other properties of this distribution, we made use of the coeffi- 
cients 8; and $2 and the Pearson system of curve-fitting.’ Both 8; and B, de- 
crease as the lag increases. Their values are (a) rather similar for the distribu- 
tion of the natural numbers and of the logarithms; and (b) with one exception,® 
all satisfy the condition 


1.1258; + 1.5 < Bz < 1.2508; + 2.0. 


Distributions meeting this requirement are of a simple, “Type XII” form, 





Pree © 1 Ee = 
atta br(1 + m)r(1 — m) Lb —| 


where the constants, a, b, and m are calculated from the moments. In particu- 
lar, when d=0, this distribution is the rectangular distribution; if 6; =0, then 
m=0; and it will be noted that, within our range of observation, 6, declines 
from 5.2 to .3 for the logarithms, and from 3.9 to 1.4 for the natural numbers. 
Calculated values of the functions are given in Table 1. 

Thus, our calculations indicate that the autocorrelation properties of eco- 
nomic time series may be described by a frequency distribution which 


(i) is roughly invariant when the data are subjected to a logarithmic trans- 
formation; 
(ii) is approximated by a Pearson Type XII function; and 
(iii) approaches a rectangular distribution with mean 0, as the lag increases. 


Economic series, whether in natural numbers or logarithms, have autocorre- 
lation properties more complicated than can be explained by the simple linear 
trend model. If, apart from “trend,” there were no tendency for an economic 
series to be “influenced by its own past,” the trend-corrected data would show 
small autocorrelations. Actually, even these show average autocorrelations of 
.65 and .68 for lag 1. Moreover, although these averages decline rapidly toward 
zero, the possibility is not excluded that, over a range of lags longer than 5, the 
average might be a periodic function of the lag (perhaps damped), as is the 
ease with some autocorrelation functions observable in the natural sciences. 

A large body of empirical work by the National Bureau of Economic Re- 
search has pointed to a persistent short-term fluctuation three to four years in 
length, on the average, for most economic series. Such a fluctuation does not 
appear to affect our autocorrelation coefficients, since nothing like a corre- 
sponding period appears in them. If the average autocorrelation does have a 
period, its length would appear to be substantially greater than the period 
which most writers have ascribed to business fluctuations, 

The distributions obtained for the trend-corrected data have fairly constant 





§ William P. Elderton, Frequency Curves and Correlation, Fourth Edition, Washington, D. C. 1953, provides 
a useful collection of methods and criteria. 
* Natural numbers, lag 1, fail by a little to meet this requirement. 
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TABLE 1. FITTED FREQUENCY FUNCTIONS OF COEFFICIENTS OF 
AUTOCORRELATION IN 100 RANDOMLY SELECTED 
ECONOMIC SERIES* 








Autocorrelation in Series** Expressed as 





Lag in Years . 
a Natural Numbers Logarithms 








1 7.091 + r7-7% 1 7.410 + r7-77 
885 L.968 — "| 1.362L.962 — “| 

1 7.287 + rq 1 7.389 + rq 
1.071 L.963 — “| 1.062 L.947 — “| 


1 [<.. + "|" 1 [== + |" 
1.148 L.966 —r 1.046L.970 —r 


1 [sa + "|" 1 .581 + - 
1.211 

















1.230L.963 —r 
1 7.615 + 4 355 1 7.550 + 4% 
1.023 L.959 —r 1.214L.948 —r 


955 —r 

















* The constants in these functions give bounds to the values of r for which non-zero frequencies exist. Thus, 
for natural numbers, lag 1, —.091 <r <.968. The calculations follow Elderton, op. cit., p. 111. 
** Not corrected for trend. 


variance for all lags, and (for lags greater than 1) are almost symmetrical. 
They do not seem well approximated by the normal distribution, since A, 
(which in the case of the normal distribution equals 3) is as low as 1.76 (roughly 


a rectangular distribution) and as high as 4.8, with no great concentration of 
values near 3.0. It does not appear reasonable to consider that the mean of the 
values of any moment for the various lags has any particular meaning, al- 
though if a moment fell in a particular small range for all lags, it might lead us 
to explore this element of similarity in the distributions. No great stability in 
8: can be indicated, however, and we cannot now say much about the distribu- 
tion of autocorrelation coefficients in trend-corrected economic data, except that 


(i) The autocorrelations from the trend-corrected data have distributions 
unlike those from uncorrected data. The distributions obtained from the 
trend-corrected natural numbers and from trend-corrected logarithms 
appear to be rather similar. 

(ii) For lag 1, the mean is substantially above zero, but declines rapidly to 
zero as the lag increases. The mean autocorrelation might be a periodic 
function of the lag, but we have not computed enough lags to be sure. 

(iii) The variance of the distributions is roughly constant, and (for lags 
greater than 1) the distributions are almost symmetrical. 

(iv) Other features of the distribution do not seem to us ts be very regular. 


4.2 Serial correlation coefficients, like autocorrelation coefficients, are distributed 
in roughly the same way for logarithms as for natural numbers. Like the 
distributions of autocorrelation coefficients, they differ according to 
whether original data or deviations from trend are considered. All dis- 
tributions are close to symmetrical: in no case is 6, over .167. Moreover, 
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all the means are small: for the uncorrected data, the means are between 
.10 and .16, and for the corrected data, they are in most cases between +.1 
and in no case as high as .16. Variances tend to be smaller for larger lags 
(in autocorrelations, they were constant or increased with the lag), and are 
several times as large as the autocorrelation variances for the same lag. 


Trend-correction makes for one important difference in the serial correlation 
distributions. In the uncorrected data, 82 is almost always less than 1.8;’ in the 
corrected data, 62 ranges from 1.9 to 3.2. It will be recalled that the Pearson dis- 
tribution corresponding to 8, =0, 82= 1.8 is rectangular. In symmetrical (8; = 0) 
Pearson distributions, those for which 6; <1.8 are bimodal, and those for which 
8,> 1.8 are unimodal. Thus, it would be correct to say that for the uncorrected 
data, the serial correlation coefficient is almost rectangularly distributed for all 
lags; but where it deviates from the rectangular, it is in the direction of bimo- 
dality rather than normality. 

On the other hand, serial correlation coefficients from corrected data tend to 
have a unimodal distribution. If 82 were symmetrically distributed about 3.0, 
one might be tempted to use the normal distribution as an approximation to 
these distributions. Only in one case out of twelve, however, does 2 exceed 3.0, 
and in nine, 62 is below 2.6. This means that, within any given interval about 
the mean (expressed in multiples of the standard deviation), there would be 
fewer observations in these distributions than in a normal distribution with the 
same standard deviation. Thus, a significance test, assuming a normal distribu- 
tion whose mean and variance were given by the present estimates, would lead 
to overestimation of the significance of any serial correlation coefficient ac- 
tually obtained. 

Two tentative experiments were carried out on the serial correlation coeffi- 
cients in an early stage of our work. These are reported here despite their in- 
exactness,* because the gross findings are of some interest. Suppose that it is 
posited, on theoretical grounds, that variable x, may influence 2, and it is 
considered possible that the effect of x; may be felt after some delay. We might 
calculate serial correlation coefficients for these variables for lags 0, 1, 2, etc., 
years. Having selected the lag for which the coefficient is maximized, we obtain 
(a) an estimate of the closeness of the relationship, and (b) an extimate of the 
speed with which the independent variable affects the dependent variable. 
This sort of procedure is, in fact, sometimes followed ia empirical work. 

The results we obtained by similar operations on our randomly selected data 
are given in Tables D-I, D-II, and D-III. Two different hypotheses were used: 
(1) The cause must not succeed its effect; and (2) the cause must strictly pre- 
cede its effect. For the first, we selected the maximum correlation coefficient 
between each pair of variables over lags 0, 1, 2, 3, 4, 5; for the second, the maxi- 
mum over lags 1, 2, 3, 4, 5. Thus, for the first hypothesis, we note that for the 
natural numbers in 46 per cent of all cases (| r| >.7), it was possible to select a 
lag such that the “independent variable” would account for half or more of the 





1 It is 1.87 for the logarithms, lag 5. 

* They used correlation coefficients which were not corrected for changes in the number of observations. Thus, 
two correlations of .5, one for lag 0 and one for lag 5 in the independent variables, were treated as equal, although 
the latter is actually smaller, being based on 20 rather than 25 observations. These experiments were not repeated 
after the correction, since the computer program used did not print out the data needed. 
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variance in the “dependent variable” ; for the logarithms, we could do as well in 
60 per cent of all cases. 

In fact, if we use data not corrected for trend, on any of the following hy- 
potheses: 


(1) the independent variable has an immediate effect; 

(2) the independent variable has an effect within 5 years; 

(3) the independent variable has a delayed effect, but the delay is of not 
more than 5 years; 


then we can, on the average, find an independent variable capable of “explain- 

ing” 50 per cent or more of the variance in any randomly selected economic se- 

ries in three random trials or less (‘Table D-IV).* If we used data corrected for 

trend, it may take as long, on the average, as twelve tries; but if we omit the 

case with zero lag, we can succeed even here in four to six tries, on the average. 
5. CONCLUSIONS 

Our results do not yet provide very clear specifications for delimiting the class 
of statistical models appropriate for analysis of annual economic time series. 
They do, however, permit some comments about earlier work. In particular, 
we shall discuss some of the economic literature on linear stochastic difference 
equations. 

Tinbergen,'® Orcutt,'! and Gartaganis” have made studies anticipating some 
of our findings. Like us, they have studied autocorrelation, but whereas we 
selected economic series at random, the first two writers used series suggested 
by the theory of business cycles, and the third used production series from 
Arthur Burns’ classic study Production Trends. Thus, a portion of the differ- 
ences between our results and theirs may result from their selection techniques, 
which prefer only a sub-population of economic series. 

Second, all three writers have studied a short period (the first two studied 
1919-32; the third, 1914-29), and the third studied a non-overlapping long 
period (1870-1913). The longer period shows quite different autocorrelation 
properties from the shorter periods, but it is not clear whether the differences 
result from the difference in the length of the period for which autocorrelations 
were studied, or whether they are due to the fact that non-overlapping periods 
have different autocorrelations. The evidence suggests that autocorrelation 
(for any lag) would increase with tke number of observations, but we have not 
tested this conjecture. 

Tinbergen’s original contribution was to prepare a business cycle model in- 
volving a number of variables of theoretical interest. He showed that he could 
compute multiple regression curves which would closely approximate the time 
paths of the individual variables in the system (for the period 1919-32), and 





® For example: 46 per cent of the pairs of variables (natural numbers not corrected for trend) showed a cor- 
relation with absolute value at least .7 for at least one lag in the set 0-5 (Table D-I). Thus, on the average, 1/.46, 
or slightly over two random tries, would be required to find as good a correlation as this. 

” Tinbergen, “Statistical Testing of Business Cycle Theories,” Vol. II, Business Cycles in the United States 
of America, 1919-1932, League of Nations, Geneva, 1939. 

" Oreutt, “A Study of the Autoregressive Nature of the Time Series Used for Tinbergen’s Model of the Eco- 
nomic System of the United States, 1919-1932,” Journal of the Royal Statistical Society, Vol. X, Series B (1948), 1. 

% Gartaganis, “Autoregression in the United States Economy, 1870-1929,” Econometrica, 22 (1954), 228. 
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that correlation coefficients large enough to pass ordinary significance tests 
were generated. He also showed that he could, by a process of successively elim- 
inating variables, end up with a single “explanatory” variable (corporate 
profits) ; that all the other variables in his system could be expressed as a func- 
tion of corporate profits; and that corporate profits could be closely approxi- 
mated by a fourth-order linear difference equation. 

At the time Tinbergen wrote, less was known than now about the properties 
of such systems. A decade later, Orcutt undertook a comparison of the auto- 
correlation present in Tinbergen’s series (for lags of 1-4 years) with that in a 
Monte Carlo experiment, using data generated by autoregressive functions 
containing a random term. He concluded that the autocorrelation in Tinber- 
gen’s series did not resemble the autocorrelation to be expected from fourth- 
order linear autoregressive equations, but instead they resembled drawings 
from a population of series generated by a single second-order linear stochastic 
difference equation, the elements of which had different non-homogeneous 
terms. 

Our results differ from Orcutt’s. The average autocorrelation in our data is 
consistently higher than in his (for lag of one year, his mean is .597, while ours 
(Table A-I) is .837; for lag 4, his mean is —.125, ours .525). Our variance and 
his are about the same for lag of one year (ours .040, his .048), but ours rises 
steadily with the lag (for lag 4, our variance is .158, his .035), and his does not. 

Gartaganis is in part concerned with differences in the autocorrelation proper- 
ties of production series belonging to different sectors of the economy and to 
different periods. He finds that there are, in general, notable differences be- 
tween autocorrelation in 1870-1913 and in 1914-1929. Within each period, 
average correlation for a given lag does not differ greatly among agriculture, 
mining, and industry. Whereas the distributions in the first two sectors are 
homogeneous, in the last, they appear to be messy, and Gartaganis does not 
treat industry in as much detail as the other sectors. 

It is, of course, clear that a series’ autocorrelation in a subperiod will not 
necessarily equal the autocorrelation in an entire period; and that if a series has 
stable autocorrelation properties, it will still vary from one interval to the next 
for sampling reasons. 

Our results show, on the average, less autocorrelation in the 25-year period 
1929-53 than Gartaganis’ in the 44-year period 1870-1913; They show more, 
on the average, than Gartaganis’ in the 15-year period 1914-1929. 

Disregarding the fact that Gartaganis uses different (and not randomly se- 
lected) series from ours, we might conjecture either that the longer the time 
period studied, the greater the observed autocorrelation in economic series as 
would follow from the properties of the autocorrelation coefficient as a function 
of the number of observations, or that the interwar period was a disturbed one 
in which pre-1914 economic properties were disrupted, and post-depression 
properties had not been established. On the second conjecture, 16-year auto- 
correlations for the period before 1914 would be greater than 16-year autocorre- 
lations for the period since 1932; and these, in turn, would be greater than those 
Gartaganis saw for 1914-29. Such a view would certainly match the intuition 
of many economic historians and be consistent with a certain amount of sta- 
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TABLE 2. DETERMINANTS OF AUTOCORRELATION COEFFICIENTS 
FOR 100 RANDOMLY SELECTED ECONOMIC TIME SERIES 








(Natural Numbers, not corrected for trend) 





Number of Determinant 
Values Less Than 


Order of Autocorrelation Function* 





2 


3 


4 


5 





.001 
.O1 
1 


19 
46 
73 





46 
64 
82 





60 
74 
91 





68 
83 
94 





(Logarithms, not corrected for trend) 





Number of Determinant 
Values Less Than 


Order of Autocorrelation Function* 





2 


3 


4 


5 





.001 
-O1 
mS 


15 
43 
72 








43 
65 
85 





59 
73 
90 





69 
83 
93 





(Natural Numbers, corrected for trend) 





Number of Determinant 
Values Less Than 


Order of Autocorrelation Function* 





2 


3 


5 





-001 
-O1 
a 


2 
12 
42 


15 
30 
61 








40 
56 
81 





(Logarithms, 


corrected for trend) 





Number of Determinant 
Values Less Than 


Order of Autocorrelation Function*® 





2 


3 


4 


5 





.001 
-O1 
1 


6 
44 





8 
30 
60 


27 
52 
76 





42 
65 
85 





® To an autocorrelation function of nth order corresponds a determinant of order n +1. The squares of the first 
degree functions are discussed in the text. 


tistical evidence.“ We have no evidence as to whether either of these is correct. 

We have mentioned the discussion by these writers of autocorrelation func- 
tions involving several Jags in the variables involved. Such functions generate 
determinants whose components are lagged autocorrelation coefficients of the 
type we have discussed. Our data permit some further remarks on the extent 
to which economic data resemble those generated by higher-order linear auto- 
correlation functions. 





4 See Ames, “Trends, Cycles, and Stagnation in U. 8. Manufacturing Since 1860,” Oxford Economic Papers, 
Vol. XI, 1959, p. 270. 
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In correlation analysis, it is shown that the determinant of a matrix of corre- 
lation coefficients vanishes when one of the variables is a linear combination of 
the others. The value of this determinant, moreover, can be interpreted as the 
percentage of the variance of one of the variables which cannot be explained 
by changes in the other variables. We can associate with a linear difference 
equation of order n a matrix of order n+1, whose entries are autocorrelation 
coefficients, and whose determinant vanishes." Since we have autocorrelation 
coefficients for lags 1-5, we may readily find out how many of these autocorre- 
lation matrices of order 1-5 actually approach singularity.” Such data would 
indicate the proportion of time series that it would be possible to approximate 
to the desired degree of precision by an exact linear difference equation. Our re- 
sults are given in Table 2. 

It is clear that the data not adjusted for trend can, in large part, be approxi- 
mate by linear difference equations of low order; trend-corrected data require 
higher-order equations to achieve a given degree of approximation. 

These findings suggest that our series more nearly satisfy a linear determin- 
istic process than they do a linear stochastic one. However, the findings still 
allow our series to satisfy, for example, a linear difference equation with non- 
constant coefficients, provide these coefficients change slowly over time. Thus, 
the coefficients might be generated by a “slow” stochastic process, or perhaps 
change discontinuously at relatively long intervals." If these findings and inter- 
pretations are correct, they also suggest the possibility that shock models (dis- 
turbances in the equations) may not fit annual economic data very well. 





4 Wold, A Study in the Analysis of Stationary Time Series, Second Edition, Uppsula, Almquist & Wiksell, 
1954, p. 45, Theorem 2. 

4% Even if economic time series were actually generated by linear difference equations, errors of measurement 
alone would prevent us from observing empirically the vanishing of autocorrelation matrices. 

* This point arose in a discussion with R. Radner. 
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TABLE A-I. FREQUENCY DISTRIBUTION OF THE CORRECTED 
AUTOCORRELATION COEFFICIENT IN 100 RANDOMLY 
SELECTED ECONOMIC TIME SERIES, 

FOR VARIOUS LAGS 


(Natural Numbers, not corrected for trend) 








First Digit of Lag of Lag of Lag of Lag of Lag of 
r, unrounded 1 year 2 years 3 years 4 years 5 years 





56 38 21 16 10 
17 17 24 18 20 
08 08 09 16 11 
06 08 02 08 
06 09 04 
03 02 06 
08 07 
07 08 
07 06 
03 02 
04 04 
04 10 
02 00 
03 
00 
01 
00 
00 
00 
00 


>I © 


os 
i) 


Sr a 
2 
S 


0 


SSS885 


- 0 
a 
2 
3 00 
4 
5 
6 


“I 











SSSs8 


oo 





© 
= 





Mean 

Moments about Mean | 
Second 
Third 
Fourth 


Bi 
Ba 
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TABLE A-II. FREQUENCY DISTRIBUTION OF THE CORRECTED 
AUTOCORRELATION COEFFICIENT IN 100 RANDOMLY 
SELECTED ECONOMIC TIME SERIES, 

FOR VARIOUS LAGS 


(Natural Numbers, corrected for linear trend) 





First Digit of Lag of Lag of Lag of Lag of 
r, unrounded 1 year 2 years 3 years 4 years 





00 00 
01 01 
00 00 
00 00 
04 00 
11 00 
12 00 
24 20 
13 17 
11 15 
09 
12 20 
02 
01 03 
00 03 
00 00 
00 00 
00 00 
00 00 
00 00 


16 
22 
23 
12 
07 
04 
05 
04 
06 
01 


_ 
~— 


SSSSESESE 





oO 
~I 


Mean 

Moments about Mean 
Second 
Third 
Fourth 





By 
Be 
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TABLE A-III. FREQUENCY DISTRIBUTION OF THE CORRECTED 
AUTOCORRELATION COEFFICIENT IN 100 RANDOMLY 
SELECTED ECONOMIC TIME SERIES, 

FOR VARIOUS LAGS 


(Logarithms, not corrected for trend) 








First Digit of Lag of Lag of Lag of Lag of 
r, unrounded lyear | 2 years 3 years 5 years 








55 28 
15 25 
14 08 
04 07 
04 08 
00 09 
01 05 
00 03 
05 00 
Ol 02 
00 02 
01 





00 
00 00 
00 
00 00 








9 
8 
i 
6 
5 
4 
3 
2 
PY 
0 
0 
1 
2 
3 
4 
5 
6 
Pe 
8 
-.9 





Mean 

Moments about Mean 
Second 
Third 
Fourth 





Bi 
Bz 
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TABLE A-IV. FREQUENCY DISTRIBUTION OF THE CORRECTED 
AUTOCORRELATION COEFFICIENT IN 100 RANDOMLY 
SELECTED ECONOMIC TIME SERIES, 

FOR VARIOUS LAGS 


(Logarithms, corrected for linear logarithmic trend) 








Lag of | Lagof Lag of Lag of Lag of 


1 year 2 years 3 years 4 years 5 years 


First Digit of 
r, unrounded 


! 
| } | 
| 
} 
| 
| 


| 


06 00 00 00 00 
25 oo | 00 00 00 
29 07 00 00 00 
13 09 00 00 00 
06 10 03 00 
11 05 00 
04 08 07 01 
18 23 13 
05 21 18 13 
03 5 04 12 
| 17 25 
02 18 21 
05 
00 07 
00 
00 
00 
00 
00 
00 


woworaasce S| 





—_ a 


to 


w 


ao 


io 





Mean 

Moments about Mean | 
Second 
Third 
Fourth 














* Absolute value less than .0005. 
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TABLE B-I. FREQUENCY DISTRIBUTION OF THE COEFFICIENT OF 
CORRELATION BETWEEN TIME (INTEGERS 1, 2,---, 25) AND 
EACH OF 100 RANDOMLY SELECTED 
ECONOMIC TIME SERIES 








First Digit of r, unrounded Natural Numbers Logarithms 





22 28 
23 16 
8 10 


2 
2 
2 
1 


Noe Wh Dh Ww OI 


_ -_ 

or) — dw © 
= oS 
a) 


ms] 
eee 
| 
| 





Mean 

Moments about Mean 
Second 
Third 
Fourth 


Bi 
Be 
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TABLE C-I. PERCENTAGE DISTRIBUTION OF THE CORRECTED 
COEFFICIENT OF CORRELATION IN 50 PAIRS OF RANDOMLY 
SELECTED ECONOMIC TIMES SERIES, FOR VARIOUS 
LAGS IN THE INDEPENDENT VARIABLE 


(Natural Numbers, not corrected for trend) 





First Digit of Lag of Lag of Lag of Lag of 
r, unrounded 0 years 1 year 3 years 











9 08 06 06 

8 06 08 04 
08 12 
12 
02 
06 
06 


aan 


rm bo 0O 


~ a 
eS eS 


Ccanourk wn — 








Mean 

Moments about Mean 
Second 
Third 
Fourth 


Bi 
Be 
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TABLE C-II. PERCENTAGE DISTRIBUTION OF THE CORRECTED 
COEFFICIENT OF CORRELATION IN 50 PAIRS OF RANDOMLY 
SELECTED ECONOMIC TIME SERIES, FOR VARIOUS 
LAGS IN THE INDEPENDENT VARIABLE 





(Natural Numbers, corrected for linear trend) 





First Digit of Lag of Lag of Lag of Lag of Lag of Lag of 
r, unrounded 0 years 2 years | 3 years | 4 years | 5 years 





02 00 
06 00 
06 00 
00 
10 08 
08 
06 





14 
08 
10 
08 
06 
04 
00 


00 00 
04 00 
00 00 
00 00 





Mean .158 P ; .018 
Moments about Mean 
Second .178 E 335 
Third — .016 J — .001 
Fourth .082 ; .031 





Bi .047 _—* | .001 

















2.154 2.098 








Be 2.605 


* Absolute value less than .0005. 
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TABLE C-III. PERCENTAGE DISTRIBUTION OF THE CORRECTED 
COEFFICIENT OF CORRELATION IN 50 PAIRS OF RANDOMLY 
SELECTED ECONOMIC TIME SERIES, FOR VARIOUS 
LAGS IN THE INDEPENDENT VARIABLE 


(Logarithms, not corrected for trend) 








First Digit of Lag of Lag of Lag of Lag of Lag of 
r, unrounded 0 years 1 year 2 years | 3 years | 4 years 





02 02 00 04 
10 12 12 08 
08 04 06 
14 12 12 08 
02 10 08 
04 
02 
10 


5 
4 
3 
2 
1 
0 
0 
1 
2 
3 
-.4 
5 
- 6 


~“J 


Coo 














Mean .127 .12! .108 
Moments about Mean | 
Second = | mee 6S er .339 
Third |—. —.089 |-. ; |— .056 


Fourth ‘ ae OF |. | .205 











By 167 | .154 9 | 082 
Bs .781 | 1.706 | 1.786 
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TABLE C-IV. PERCENTAGE DISTRIBUTION OF THE CORRECTED 
COEFFICIENT OF CORRELATION IN 50 PAIRS OF RANDOMLY 
SELECTED ECONOMIC TIME SERIES FOR VARIOUS 
LAGS IN THE INDEPENDENT VARIABLE 


(Logarithms, corrected for linear logarithmic trend) 


First Digit of Lag of Lag of Lag of Lag of Lag of 
r, unrounded y 1 year 2 years | 3 years | 4 years | 5 years 











00 00 00 00 00 
02 00 00 02 00 
04 02 02 00 02 
00 02 00 02 00 
04 =| 06 00 
04s 02 04 
02 08 
10 | 14 22 

20 08 

12 

is | eS 18 

14 | 

04 
04 

















NO os 


oe 


: 
| 


© 


Pics antiraedan 2 ees 








Mean 

Moments about Mean 
Second 
Third —s 
Fourth | .046 





B, as 
Bs | 3.233 














® Absolute value less than .0005. 


TABLE D-I, PERCENTAGE DISTRIBUTION OF THE MAXIMUM ABSOLUTE 
VALUE OF THE CORRELATION COEFFICIENT OVER LAGS 0-5, FOR 
50 PAIRS OF RANDOMLY SELECTED ECONOMIC TIME SERIES 








Natural Numbers Logarithms 





First Digit of 
r, unrounded Not corrected | Corrected for | Not corrected 
for trend linear trend for trend 


Corrected for 
linear loga- 
rithmic trend 





32 24 36 
30 34 26 

2 2 4 
20 34 10 
14 2 24 
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TABLE D-II. PERCENTAGE DISTRIBUTION OF THE MAXIMUM ABSOLUTE 
VALUE OF THE CORRELATION COEFFICIENT OVER LAGS 1-5, FOR 
50 PAIRS OF RANDOMLY SELECTED ECONOMIC TIMES SERIES* 











Natural Numbers Logarithms 





First Digit of 
r, unrounded Not corrected | Corrected for | Not corrected 
for trend linear trend for trend 


Corrected for 
linear loga- 
rithmic trend 





7to .9 30 20 30 8 
.2 to .6 30 32 32 30 
ie SS 2 6 4 12 
—.2to —.6 24 38 12 40 
= .7te =2 12 4 22 8 

















® In some cases, the total may be less than 100 per cent. This is because two correlations were of absolute value 
equal to the maximum, but of opposite sign, and could not be entered in this table. 


TABLE D-III. AVERAGE ABSOLUTE VALUE OF THE CORRELATION 
COEFFICIENT IN 50 RANDOMLY SELECTED PAIRS OF 
ECONOMIC TIME SERIES, FOR VARIOUS LAGS 











Natural Numbers Logarithms 





Lag in Years ; Corrected for 
Not corrected Corrected for Not corrected linear loga- 


for trend linear trend for trend diake wna 





571 .385 .573 . 283 
559 B86 583 .303 
548 .362 .583 312 
518 .327 558 .284 
.508 .306 545 | .283 
.499 290 541 269 

















TABLE D-IV. AVERAGE NUMBER OF RANDOM TRIALS NEEDED TO 
FIND AN ECONOMIC VARIABLE ‘EXPLAINING’ HALF OR 
MORE OF THE VARIANCE IN A GIVEN VARIABLE 








Natural Numbers Logarithms 





Corrected for 
linear loga- 
rithmic trend 


Not corrected Corrected for Not corrected 
for trend linear trend for trend 





Lag 0 3 5 plus P 12 
Lags 0-5 max. 2 plus 4 minus : 5 plus 
Lags 1-5 max. 2.5 4 plus 6 




















FITTING OF STRAIGHT LINES AND PREDICTION WHEN 
BOTH VARIABLES ARE SUBJECT TO ERROR 


Max HAcreERIN 
General Engineering Laboratory, General Electric Company* 


Define the linear structural relation with parameters a, 8, by 
u=a+yr. This paper considers the problem of confidence region esti- 
mation of a and § based on a sample of independent pairs, (y;, 2z;), 
i=1,2,--+-,n, with Ey;=y;, Ex;=v»;. The pairs (y;, 2;) are assumed 
to have a common (unknown) covariance matrix and to follow the 
bivariate normal distribution. Point estimates 


6 = > liys > Lz., a= 7 — Bz, 
i-l i=l 
subject only to 


~k =0, > lt =1 
tol t=—l 

and that the choice of the 1; be independent of the actual observations 
are shown to lead to an exact confidence region for (a, 8). The physical 
size of the region and whether or not the region is closed depend strongly 
on the choice of the l;. Using as a criterion the maximization of the 
probability that the region is closed, it is shown that the optimum J; 
require knowledge of the »;; however, the region is at the required level 
even if the J; are sheer guesses. Of course, if one has a priori information 
about the »; levels (presumably not completely accurate) it can and 
should be used. The use of auxiliary experimentation to improve the 
choice of the 1; is briefly discussed and it is shown that near optimum 
regions can be expected by such a device providing measurement error 
is small relative to the variation in the »;. The question of error control 
in predicting a number of new »;’s from corresponding new y;’s on the 
basis of a single estimate of (a, 8) is also considered. It is suggested 
that instead of controlling the proportion of errors, one control instead 
the probability that the proportion of errors will not be more than some 
specificd amount; intervals based on this principle are derived for sev- 
eral different assumptions about the kind of information available. 


1. INTRODUCTION AND SUMMARY 


HE investigation of this paper was motivated by the following frequently 
yeah situation: 

Over a period of time a laboratory will be presented with a number of test 
pieces and asked to estimate some characteristic of each piece. A direct de- 
termination of the characteristic may be difficult, expensive or time-consuming. 
Because of this the laboratory may have searched for and found an alternative 
measurement which is cheaper, quicker or less difficult; the laboratory then 
performs a calibration experiment to ascertain the relationship between the 
two systems of measurement and subsequently will estimate the “true value” 
of the characteristic of interest for each of a number of test pieces (not in- 
cluded in the calibration experiment) on the basis of the estimated relationship 





* This paper was prepared while the author was a Statistician with the Advanced Development Activity, 
Knolls Atomic Power Laboratory, Schenectady, New York. 
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and the measured value of the “easy to get” variable for each test piece. It is to 
be emphasized that on the basis of a single calibration experiment one will 
generally estimate the “true” values for a number of different test pieces. 

If one is prepared to assume the form of the relationship, at least over a 
range, two problems arise: 


(a) Estimation of the parameters of the relationship and the precision with 
which the parameters are estimated. 

(b) Estimation of “true” values based on (a) and values of the “easy to get” 
variable and the precision with which the “true” values are estimated. 


We consider these problems for the situation where the relationship can be as- 
sumed to be linear. 

In connection with problem (a) estimates of the parameters and associated 
confidence regions are suggested for the situation where the observed values of 
the independent variable contain an error component but the experimenter 
either feels able to guess a priori at the corresponding true values or it is 
feasible to do an auxiliary experiment to generate guesses. It is also shown that 
the closer one’s guesses are to the true values of the independent variables the 
more likely it is that the confidence region for the parameters is closed; it is 
further shown that the probability of parameter points other than the true one 
being covered by the confidence region also depends on the quality of one’s 
guesses. If one makes perfect guesses, the probability of coverage of a point 
other than the true parameter point is uniformly less than the probability of 
coverage of the true parameter point; even in this case, however, it is possible 
to find a parameter point an infinite distance away from the true parameter 
point which has non-zero probability of being covered for small samples. The 
confidence region operating characteristic for perfect guesses, just defined, is a 
lower bound for similar operating characteristics when guesses are not perfect. 
Of course, deviations from the lower bound are greater as guessing gets worse. 
Thus even with excellent guesses one would expect the probability of coverage 
of parameter points in some neighborhood of the true parameter point to 
exceed the probability of coverage of the true parameter point. This only em- 
phasizes the importance of the quality of one’s guesses. [t might be mentioned 
that this last point reflects upon the estimates and confidence regions given 
by Wald [8] and Bartlett [2] since their estimates are special cases of the 
estimate of this paper corresponding in fact to less than optimal guessing. The 
question of consistency of the estimates is also discussed. 

It might be emphasized even though we have spoken of guessing the true 
values, that in calibration experiments one will usually be making a conscious 
effort to space the independent variables over a range. Thus the guesses may 
frequently be more like “a priori” information. 

For problem (b), whether or not the independent variables are subject to 
error, the experimenter is usually concerned with controlling the proportion of 
incorrect confidence statements where the individual predictions made for 
each sample calibration curve are counted individually. More specifically, the 
experimenter would like to have some control over the number of erroneous 
confidence statements per calibration curve. We suggest a procedure for con- 
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trolling error (but not in the usual confidence sense) which is in the spirit of the 
discussion of this problem by Acton [1]. 


2. ESTIMATION OF THE PARAMETERS OF A LINEAR RELATION 


Consider the functional relationship: 
h=at By, (2.1) 


where y and v are non-random variables. Suppose u is observed as y=yu+¢ and 
that v is observed as x=v+n where 


Ey = g, 
Vary =o2 Var x = a, (2.2) 
Cov ry = pe.0y, 0< p? <1. 


Also suppose that x and y are bivariate normal including the degenerate case 
o,°=0, in which case v is known without error. 

We assume that we have a sample (21, y:) - - - (n, Yn) of independent vectors 
with Ex;=v;, Ey;=y; and variances and covariance as above. 

Various solutions to the point and confidence interval or region estimation 
of (a, 8) have been given; point estimation is comprehensively reviewed in a 
recent paper by Madansky [6]. The solution derived here is intimately related 
to the solutions by grouping offered by Wald [8], Bartlett [2] and others and 
also to solutions by the use of instrumental variables described by Geary [5] 
and other writers. 

Our solution to the problem posed above proceeds as follows: 

Define variables z; by 


zZ=y-a-—ft, t=1,2,-+-,M. (2.3) 


Under the assumptions we have made the z; are independent normal variates 
with zero means and common variance o? = ¢.— 28p0.0,+ B’a,,. 
We define new variables w,; and we: by 


1 n 
wi = Fe da 


Ww: = ya lzi, 
1 


Di=0, DPi=1 
1 1 


and the 1; are determined independently of the sample data—or at least of 
those sample data which are random variables. Thus w, and w, are independent 
and we know from matrix theory that we can define w;, w, - - - , w, so as to 
obtain an orthogonal transformation of all the z;. If we define 


n — 2 wy? + w?? 
Fen—-2 = : : (2.5) 


2 > ae 
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we can then assert that F:,,-2 has Snedecor’s F distribution with 2 and (n—2) 
degrees of freedom, and consequently the inequality 


Pon—2 S Fon-2(y) (2.6) 


where F2,,-2(7) is the value of F:,,-2 exceeded with probability y, provides us 
with a 100(1—)% confidence region on (a, 8). (2.6) is analagous to the con- 
fidence region for (a, 8) given by Wald [8] and is in fact implied in his discus- 
sion. In view of the essentially arbitrary nature of the J; it is natural to ask how 
they should be chosen; in addition it is natural to ask whether the region just 
defined is a useful region, i.e. is it closed and what is its shape? We consider the 
second question first. 
For notational simplicity write: 


eile = Paa-s(0, Ses = (1 + R)U(a)I() — RScs, 
ere 


l(t) = A Lit; p* $,,/P,2, 
1 


Sa= dS (@:-)G%-5, f= 1@)AG). 


With this notation we can write our confidence region, after some simplification, 
as 


ny gio ae OR Bz)? + ,,(8* i 8)? < B**4,, et yy. (2.8) 


It is easy to show that if ,,>0, the inequality above describes an elliptical 
region, while if #,.<0, the region will be hyperbolic and therefore open and 
frequently not useful. 

It is to be noted that, if ¢,2=0 so that the »; are known without error and we 
can properly define 1;=(»;—¥)/S,,"", our confidence region becomes identical 
with the region following from least squares estimation of a and 8 and nor- 
mality, homoscedasticity and independence of the ¢;. 

Now we turn to the question of how to choose |, l:, - - - , ly. In our problem 
it appears natural to choose the J; so as to maximize the probability that 
$,,>0; ie. maximizing the chance that our confidence region will be closed and 
therefore useful. Maximization of Pr{,2>0} can be shown to be equivalent 
to maximizing Pr{ T>R } where T' is the ratio of two non-central x? variates 
with degrees of freedom 1 and n—2 and non-centrality parameters /*(v)2¢,? 
and [S,,—/*(v)]/2e,? for the numerator and denominator, respectively. Thus 
one can write 


Pr{T > R} = Pr} > Rt (2.9) 


wi? + wet +--+ + wis 


where ¢, and the w; are normally and independently distributed with 


l /Sm — Pv) 
Et, = ll Ew, = Vea ’ 


oy oy 
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Ew; = 0, $=22,-+-,98—-2 


Var t, = Var w; = 1, @#=1,2,---,n—2. 


If now one makes the transformation 
i; = rsin @ 


ts = rcos 8, 


“ 


one finds that the density of r, 0, we, - - + , Was is given by 


n—2 


Dw? 


2 ¢ r? 2 
—r cosh [rS,,'/? sin ¢] exp — yew 


(2x)"—1/2 2 


where 


n—2 
t = 6+ cos~'[l(v)/S,,"/?], / >: wird eo,—27 <I wMy< ow, 


and 





/ - — 2 a } 
w; 9 
; R | - 
sin! ee" -+1|/ <@<2-—sin™ — +H: 


1+R\ Fr } 1+R r? 


Now for fixed r and S,,, cosh [rS,,“/? sin £] is monotone in | sin ¢| . In addition it is 
apparent that as cos~[l(v)/S,,"/?] increases from zero, integration of (2.10) on 
@ is over a rotated constant angle and that for every r, we, - ++, Was, the 
minimum value of | sin ¢| included in the integration interval will only be re- 
duced as cos~' [I(v)/S,,/?] departs from either zero or +. Thus Pr{ #,.>0} is 
maximized by taking /*(v) =S,,, that is 1;=(»;—7)/S,,""*. It might also be men- 
tioned that this choice of the /; also maximizes E®,, which, crudely speaking, 
plays the role of the variance of 6*. Hence the criterion max Pr |{ ,,>0} tells 
us to guess the v; as shrewdly as possible in advance of the experiment and de- 
fine the 1; as above regardless of the outcome of the experiment. It is to be 
emphasized that regardless of the quality of the guess we will have a valid con- 
fidence region in the sense that the confidence probability is precisely (1—a). 
If, for example, the experimenter feels that he can use the Wald or Bartlett 
procedures he probably has a fair idea, in advance, of the values of the in- 
dependent variable; if this is the case, he is well advised to guess the »;. 

Now we turn attention to the performance characteristic of the confidence 
region (2.8). That is, we ask for the probability that a parameter point, (a’, 
8’) say, different from the true parameter point (a, 8), be included in the region 
(2.8). This probability can readily be seen to be 1—Pr{7>R} where T is the 
ratio of two non-central x? variates with degrees of freedom 2 and n—2 and 
non-centrality parameters 


n[(a — a’) + (8 — BP]? + B- Bye) 6 — 6’)*[S,, — F(r)] 
20,7 2c)? 





(2.11) 





662 AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1961 


where o,?=¢.2+2(pe.—Bo,)(8—8’)o,+(8—8’)*c,?, for numerator and de- 
nominator, respectively. It is readily seen by the same type of argument used 
in determining max Pr{#,.>0} that 1—Pr{7>R} is a minimum for given 
a, 8B, (a—a’), (8—8’) when /*(v) =S,,. In this case it is apparent that the prob- 
ability of coverage of (a’, 8’) is uniformly less than the probability of coverage 
of (a, 8). By fixing (2—a’) and B and letting (6—£’)*—> =~, we see that in this 
limiting situation (2.11) becomes 

ne ian (2.12) 

20,’ 

so that, even in this most favorable case, there is finite probability, for small 
samples, of including parameter points in the region (2.8) which are infinitely 
distant from (a, 8). It is also to be noted that if l(v) =0 and (a—a’) = (8—8’),, 
then the probability of coverage of (a’, 8’) is uniformly greater than the prob- 
ability of coverage of (a, 8). It is thus clear that for any choice of the 1; other 
than the optimum there exists a neighborhood of (a, 8) for which the prob- 
ability of the region (2.8) containing parameter points in the neighborhood is 
greater than the probability of the region containing (a, 8) and that the 
neighborhood enlarges as l(v)—0. This only underlines the importance of 
choosing the 1; as close to optimal as possible. 

Since we are in the situation where a and § are non-identifiable (see Madan- 
sky [6], p. 180) it is of interest to see how good one’s guesses of the v; must be 
in order to achieve consistent estimates of 8 (and, therefore, a). Actually, in 
our discussion, we have mentioned two types of estimates of 8, 8 and 6*, the 
latter arising as the center of a confidence interval for 8. We consider both 
types of estimates and define Bg and B¢* as estimates of type § and 6*, re- 
spectively, with coefficients (v:¢—¥e)/Se,,, where the extra subscript G implies 
the guessed nature of the »;¢; 8 and 6* will be taken to involve the true 7. 

We have then, using the notation of (2.7), 


Say 


Savz 


Be 


Now define 


v / i /2 
Sevy lS avy 


1/2 
Sar: /Sew. 





ENe —— pve, »), 
Sew» 


B Lie — Dg) (vi — P) i a4/ Ss. 


Sew 
2 
Te 
Var Ng = —> 
Sew 


where p(ve, v) is the correlation between the vig and »;. Thus if the correlation 
between the »;¢ and »; is non-zero and the variance of the v;¢ is bounded from 
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below, away from zero, Ng will converge in probability to EN <0. 
Similarly, 
Sy» 
EDe = oe p(va, v), 
Gy 
2 
Var De = ’ 
Sen 
so that under the same conditions as just noted Dg converges in probability to 
ED;<#0 and it follows that Bg will be consistent. Unless one is making sheer 
random guesses, the conditions above are sure to be met. A similar discussion 
holds for B*g. 

It is interesting to observe that even though we have talked about guessing 
the »;, the validity of the confidence region is not affected if we obtain the r;¢ 
by doing an independent auxiliary experiment to estimate the v;; the validity 
of the results will be of course in the conditional sense, i.e. given the particular 
set of outcomes of the auxiliary experiment. The auxiliary experiment might 
consist of mere replication of the measurement procedure on the identical 
experimental material or a re-run of the entire experiment involving new ex- 
perimental material. Note also that it is not crucial for the auxiliary experiment 
to involve exactly the v; levels of the rest of the calibration experiment but 
only roughly the same levels. Use of this auxiliary experiment can be expected 
to generally give near optimal small sample results. We say expected since the 
outcome of the auxiliary experiment is of course a random variable so that we 
can only assess its usefulness in a probabilistic sense. 

Let mc, °*+*, ma be the results of the auxiliary experiment and suppose 
Evig=v;i, Var vig=a,?. Then, according to previous discussion, one will be near 
optimal both in terms of the performance characteristics of the confidence re- 
gion and the probability that the region is closed if the quantity 


> (vie = de)vi/V [> (via = de)? [Do(% a 7)?] = p(vq, v) 


is near unity. If we transform the »;¢ by an orthogonal transformation of which 
the first two components are 


Eo. ana/4/ Ee 9° 





ts > vie/ Vn, 
I 


it is readily verified that after an appropriate change of scale, 


21 
p(vo, ») = <== ? 


n—1 


4/2 + Die? 
2 





where, unconditionally, 2:, z:, -- +, Z, are independent normal variates with 
Ez;=0,i=2, ---,n, 2a=S,,""/o,, Varz;=1,1=1, 2, --- ,n. Thas 
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/ p*(ve, vy) ne 21 Tz ta—2 
1 — p*(va, ») < vn—2 
5 2 
2 








where ¢,—2 is a non-central Student’s “?” variate with non-centrality parameter 
S,,/2/o, and (n—2) degrees of freedom. We have 


C5 — 7 
T a 
t. l 


, / & 1/2 Co cuneate: Gamieeneie py ll? ® 
ES = Bni!"/on) 3 a(S = Fay Swit) 





where the approximation is off by about 23% for n=5, and steadily increases 
in accuracy with n. This corresponds to 


+1 
p(va, ¥) = (2.13) 


fae 


It is evident from (2.13) that | p(ve, v)| will tend to be very close to unity in 
any practical situation, i.e. where the spread in values of the independent 
variable is quite large compared to measurement error. The point could be 
made more precise by numerical investigation of the probability distribution of 
p(vc, v) but this hardly seems necesary. 

It is also worth noting that we can write 


.. = [(1 + R)p%(lz) — RJ Sze 


where p(lz) =1(x)/+/S,, is simply the correlation coefficient between the 1; and 
the z;. Thus the condition #,,>0 requires 








ers 


> cusmuswaitiit> © 
|p| ee 


R : F 

n—2 sma), 
it is clear that for n small the choice of the /; is of major importance in obtaining 
a closed region while for n large, since R--0, the region will be closed for almost 
any set of l;. However, in any event, it is desirable to have | p| as large as 
possible so as to maximize #,, and consequently minimize confidence interval 
length. Note that | | =1 gives the shortest confidence interval and can only be 
attained when ¢,?=0, i.e. when only the y; are subject to error. This is gratify- 
ingly in accord with intuition. One implication of the above is that we can 
consistently expect that either the Wald or Bartlett procedures, when ap- 
plicable, are not too useful for small n. The reason for this is that in both proce- 
dures the J; are chosen without any real consideraticn of what the individual 
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x; are likely to be and in addition the /; are chosen equal in groups. There would 
certainly be some correlation expected because of the ordering of the x groups 
in these procedures, but one could hardly expect more than modestly high 
correlation. To clarify ideas, it may be useful to see the correlation required to 
have a closed confidence region for various sample sizes and confidence coef- 
ficients. Results of the necessary calculations for several choices of y and n are 
given in Table I. 


TABLE I. MINIMUM VALUES OF |p(lz)| REQUIRED FOR THE 
CONFIDENCE REGION ON (a8) TO BE CLOSED 


1—y=confidence level; n =sample size 








3 4 5 10 20 42 





.9999 .9950 -9765 .8269 -6328 -4535 
.9982 9747 .9294 -7261 .5318 .3522 
.9950 . 9487 .8857 -6614 .4748 -3205 
.9798 .8944 .8116 .5754 .4044 . 2784 























An obvious application of the above results is in the case where the inde- 
pendent variate is “controlled” to a series of nominal values. For this case, as 
pointed out by Berkson [3], we can fit by the usual least squares procedure if 
we use the nominal rather than actual values of the independent variate; the 
validity of such a procedure depends upon the assumption that the observed 
values of the independent variates are unbiased estimates of the nominal 
values. The results of this paper would suggest, for this case, defining the J; 
in terms of the nominal values of the independent variate. This would not be 
feasible in biological applications where one will usually know only the nominal 
values of the independent variate. However, in calibration problems in chem- 
istry, for example, one will frequently know both nominal and measured values 
of the independent variate. In such cases one can thus avoid the unverifiable 
assumption of unbiasedness of the observed values of the independent variate 
relative to the nominal. 

As further points, we observe that the discussion above, with obvious 
changes, applies to a confidence interval for 8 based on 


Pe (n — 2)[U(y) — 6U(z)}? 

Sw — 288zy + B*Se2 — [U(y) — Bl(z)]*” 
and also a confidence interval for a when 8 is known for any validly chosen set 
of 1;. 

We close this section with a numerical example. The data were supplied by 
A. W. Kimball of Oak Ridge National Laboratory as an example in which 
Bartlett’s method gives an imaginary confidence interval for 8. A confidence 
interval for 8 only is given by 


B**b,, — PD, B**d,, — Dy 
+ ———— <B< — 
“ / #.. ai ° 43 . * *.. 





where the notation is as given previously except that here R =(,?/(n—2) where 
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t, is the value of Student’s “t” with (n—2) degrees of freedom exceeded in 
absolute value with probability y. We take y=.05, so that R = 1.9265. 
The data and summary statistics were as follows: 











S22 =77.5741; Sey =39.1873 


Sy =19.7994; S@.e=77.5350 





Son =77.5000; Se, =39.1700 











Use of Bartlett’s method is equivalent in our notation to taking h=/,.= —4, 
ls = k =Q, ls = l= }. We thus find I(x) =%;-—%= 7.01 and ®,.. = —3.8690 so that 
the confidence interval is imaginary. Using Se.2=J/Senl(x), Sey= VSenl(y) 
we find #,,=77.5622, #,,=19.7932, $,,=39.1885, 8*=.5085; the “point es- 
timate” &¢=.5045 as compared with the estimate .5007 by Bartlett’s procedure. 
Finally 

| (BG**,_ — By)/¥z2]'/? = .0117 so that 


4936 < 8 < .5170 
with confidence .95 for the selected set of v;g. 


8. MULTIPLE PREDICTION FROM AN ESTIMATED LINEAR RELATIONSHIP 


In terms of the notation of section 2, the problem we wish to consider is that 
of prediction of an indefinite number of v’s, vyi, vy2, - - - , given corresponding 
estimates, yw, Yw2, °° +, Of uv, wwe, - ++, . We will also wish to make some 
sort of statement about the reliability of the estimates. This problem arises 
whether or not both variables (u, v) are subject to error and we consider both 
cases, assuming initially that only the dependent variables are subject to 
error (¢,?=0). 

The practical issue in the multiple prediction problem is a simple one. The 
experimenter would like to control the probability that some fixed number or 
more (or fixed proportion or more) of the total number of confidence state- 
ments made at a given confidence level are erroneous, where each confidence 
statement involves a new y but the same estimates of a and 8. 

As suggested by both Mandel [7] and Acton [1], if the number of predic- 
tions made on the basis of a single pair of estimates of (a, 8) becomes at all 
large, the type of error control described above leads to limits which are so wide 
as to be of no use to the experimenter. Mandel appears to suggest that the way 
out of this dilemma is to ignore it, that is one should use confidence intervals 
appropriate to controlling the error rate for a single prediction. Acton, on the 
other hand, in effect, suggests changing the type of assurance requirement. He 
suggests getting a tolerance interval for the distribution of y for each v (v 
presumed known). In this way one generates an envelope of tolerance limits. 
For any new y, one then finds the limits for the corresponding v as the extreme 
values of y on the envelope curve at the new y value, Except for the fact that 





FITTING AND PREDICTION WHEN VARIABLES ARE SUBJECT TO ERROR 667 


the confidence coefficient of the tolerance interval is somewhat ambiguous (since 
the tolerance intervals described by Acton are computed to be at a given 
confidence level for each v rather than all v) one then has a fixed level of con- 
fidence that each prediction interval is at least at the confidence level specified 
by the nominal coverage of the tolerance interval. 

The procedure we wish to suggest is in the spirit of Acton’s suggestion but 
somewhat easier to apply. 

Thus, consider the special case where co,’ is known. Suppose we have esti- 
mates @, 8 of a, 8 distributed in a bivariate normal distribution, with variances 
(taking o,.?=1) 


—-+ - 
n S, 


1 py 
v 


1 
yaa 
Bis 


and covariance —7/S,,. We will also have observations yi, +--+, ywe With 
unit variances and expected values a+ vy;,i=1, 2, - - - , k. If we consider the 
distribution of the yy; for given 4, 8, it is clear that we can obtain a confidence 
interval for each vy; at, say, confidence level (1— y) for the repetition set of the 
distribution of yy;. This confidence interval for vy; will simply be 
wirenb cy, ¢wmnath 
B B 

where 4 is the standardized normal deviate exceeded in absolute value with 
probability y. 

Since the yy; are independent, the chance that r or less of our k statements 
are in error can be calculated as 


De eCyyi(1 — yy". 

0 
It is to be noted that as k—> «© we approach 100% assurance that exactly 100~% 
of our statements are wrong. However, the confidence statements we have 
made contain nuisance parameters a and 8 and are thus not useful as they stand. 
A natural way to modify these statements is to replace a and @ in each of these 
statements in such a way as to insure that the new interval thus obtained in- 
cludes the interval it replaces. We cannot get 100% assurance of such an in- 
clusion but can obtain at least a 100(1—06)% assurance (0<@<1) by getting 
an upper limit to (yv;+t,—a)/8 and a lower limit to (yv;—t,—a)/B based on 
a 100(1—6)% confidence region on (a, 8). A straightforward calculation as 
described by Mandel [7] gives the upper limit for an expanded interval for 
yni as 


vni SP 
‘ (20 FL nh = ywi — t)? — 2x27 
} {3 — yNvi— t,) + V a x2?(8) [s / ( YN 7) X2 ( h 


‘ x Sw 
{B°S., baa 2x2%(6)} 


and a lower limit of the same form with yy;+t, replaced by yn;—t, and the 
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square root prefixed by a minus sign; here x27(@) is the upper 100@ per cent 
point of the chi-square distribution with 2 degrees of freedom, and # is the 
usual least squares estimate of £. 

We need to ask now what this type of interval does for us error-control-wise. 
If k-+«, we can say that we are 100(1—6)% sure that our error rate (in an 
infinite number of uses of a given calibration line) is not greater than 1007%. 
Obviously for some calibrations (1000% of them), our error rate will be higher. 
If k is finite, error control is not quite the same. In analogy with the infinite k 
situation, we can say we are 100(1—#6)% sure that the expected number of 
errors per calibration line is no greater than yk; if we want to phrase the matter 
in terms of protection against r or less errors out of k we can say that we are 
100(1 — @)% sure that the probability of r errors or less out of k is no more than 


Dd Ci — y)*%. 
0 


It should be emphasized that one is speaking conditionally here, i.e. for given 
a, B. 

Thus the procedure just described gives us a high probability of at least a 
given level of error control for an infinite number of predictions from the same 
calibration curve; if k is finite our assurance is on the expected level of error 
control. 

In the discussion above we have assumed that ¢,? is known. The more usual 
situation is that in which ¢,’ is estimated in the calibration experiment by 
S?., (sum of squares of deviations from regression divided by n—2) and used in 
all subsequent confidence interval estimates of unknown »’s based on the given 
calibration line. In this case the situation is somewhat more complex. To ob- 
tain a solution here we define variables 


t(§) a vn (j ae BP) /Sy.», 
t(8) = VS,, (8 — 8)/Sy-», 
ty; = (yi —- @= Bvyi)/Sy-», {= l, 2, © Sie a‘ 


The joint distribution of these (k+2) t-variates is a multivariate “t” distribu- 
tion with n—2 degrees of freedom (as derived in ref. [4] for example) whose 
covariance matrix is the unit matrix. If one defines new variables by 


12(4 t? 1/2 
ae n mf fig tet $=1,2,---,k, 
2 n-—-2 | 


one can show that the z; have a multivariate “t” distribution independent 
of t(y), (8) with n degrees of freedom and that each z,; has a Student’s “t” 
distribution with n degrees of freedom. Consequently, we can write for each 
vy; a 100(1 — y)% confidence interval as 


yni — a\? n—-2 f (9) + 28) 
¢ oe eee §. < 2 ey —sl1l ——————}, 
(>» i B ) al a t, (n)S, r + niete = \ 


where #,?(n) is the square of the value of ¢ with n degrees of freedom exceeded 
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in absolute value with probability y. Thus, conditionally, one has, per calibra- 
tion line, an average error control of exactly y. If k= «, we cannot in this case 
pass from average error control to exact error control per calibration line. This 
is because the common S,., induces a dependence among the probabilities of 
of error for statements on separate vy; so that even for k= © the variance of 
the proportion of errors out of the k statements is finite. This fact underlines the 
importance of having separate estimates of error in the repeated use of a single 
calibration curve; if such estimates were available the results for this case 
would be identical in nature, with obvious changes in form, with the case of 
known variance. Returning to the case of a common S,.,, we can, using 
t2(9) +02(8) =2F 2,2 and a confidence region based thereon get upper bounds 
on the upper litmits and lower bounds on the lower limits of our conditional 
confidence intervals at confidence level (1—6) of the same form as if the 
variance were known. 

Although the discussion above has been in terms of v’s known without error, 
it is clear that this is not an essential feature. The only real distinction when 
both variables are subject to error is that the basic error variance for the 
calibration constants and for new y values are different. Whether ¢.? is known 
or estimated (separately for each vy; or only once), the discussion we have 
given requires only trivial and obvious modifications due to the variance in- 
equalities just described. 
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THE OTHER SIDE OF THE LOWER BOUND. 
A NOTE WITH A CORRECTION* 


JosePrH BERKSON 
Mayo Clinic 


This note has as its primary purpose the correction of a statement 
made in a previous paper regarding the attainment of the variance of a 
proposed estimator of its lower bound. At the same time it seemed use- 
ful, as an expository function, to recapitulate a development of the 
lower bound which led this writer to a better understanding of it, and 
to the realization of the indicated error. Here the bound is developed 
ab initio from a simple identity, instead of secondarily from Schwartz’s 
inequality, as is usual. It should be stated that, mathematically, there 
is nothing essentially new in this development; however, it yields, in 
addition to a clarification of the “why” of the bound, a necessary con- 
dition for its attainment. The element of novelty in the statement of 
this condition is doubtless mathematically trivial, but may be of use 
in application. Also it is hoped that its direct derivation from the same 
elementary identity as the bound itself is derived, may make it under- 
stood by the nonmathematician—and possibly even by mathemati- 
cians! The usual estimate of the mean of the normal function, whose 
variance is well known to attain the lower bound, is used as an example 
to illustrate how this condition is fulfilled, and the logistic function il- 
lustrates how it is not fulfilled. 

The question dealt with—attaining of the variance of an estimator to 
its lower bound—.s, of itself, of only limited technical interest. Its rela- 
tion to broader questions of estimation, in particular, maximum likeli- 
hood estimation, cannot be dealt with in this short communication. 
Clearly, the fact that an estimator has lower-bound variance is no 
guarantee that the estimator is good, much less that it is “best.” The 
bound may itself be very high and besides, the bias of the estimator 
may be large, so that the mean square error can be enormous compared 
with another estimator whose variance does not attain its much lower- 
bound, and has little or no bias. For the author’s views on more general 
questions of “optimum” estimation, interested readers may wish to 
consult papers [1], [2], [3], [4], [5], [7]. 


ey Y, X are associated random variables, with X coded so that HZ (X)=0, 


we can write for the variance of Y the identity 


V(Y) = Bl — Y) (1) 


~0O61) gf, oon Mh 


V(X) © V(X) 


where E stands for expectation, V stands for variance, C stands for covariance, 
and Y=E(Y). 

Suppose we are dealing with a function ¢ (S, @) =¢, the probability distribu- 
tion of sample S depending on a parameter 0, 7 (S)=T an estimator of 6, and 
d In ¢/d@=D. Since E (D) =0, we may take X = D, and say Y=T. Then 


V(X) = V(D) =V (< ss 
2 ut do )- 


* This investigation was supported in part by a grant from the National Science Foundation. 
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which is Fisher’s amount of information on @ in the sample, and 
C(X, Y) = C(D, T). 
Now, it has been shown that 
C(D, T) =1+0' (2) 


where b is the bias of 7’, that is, E(T)=6+5, and b’ is the derivative of b with 
respect to 6. 
We may then write (1) as 


V(T) = el -1) ~~~ p] +52. 


3 
; (3) 

Since the first term on the right-hand side of (3) is the expectation of the 
square of a variable, it cannot be negative and 


(1 + b’)? 
I ? 
which is the famous lower bound of Fréchet [12], Rao [14], [15], and Cramér 


[9]. 

Consider the first term on the right-hand side of (3). In order that V(7) 
attain the “information limit,” that is, if the equality of (4) is to hold, it is 
necessary and sufficient that the quantity inside the bracket be zero and the 
condition therefore is 


V(T)2 (4) 


#5 1 b’ 
T=T+ . D (5) 


where T= E(T). 

The value of T, b’ and I is independent of the sample, S, while D is a func- 
tion of S; it will be noted that 7 must be a linear function of D. 

As an example, we may take the usual estimate of the mean u of the normal 
distribution whose variance is well known to attain the lower bound: 


1 n 
¢ = ——=— exp | - > (x; — w/20| (6) 
(oV/2r)" i=1 
where the sample is the set of n observations 2%, to, - + + , Xn. 
din} a 
D = —— = — ), (% — ») (7) 


du Co él 
(8) 


If we wish an estimator, 7, for u, which is unbiased and has lower-bound 





! The lower bound for the variance of an estimate has a long history, going back at least to the memoir of 
Pearson and Filon in 1898 [13], including a notation of Edgeworth in 1908 [10], and the classic paper of Fisher in 
1922 [11]. The formula (4) appears to have been first published by Fréchet in 1943, and developed independently 
by Rao and Cramér. 
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variance, then to fulfill (5), since with 7 unbiased, T=, and b’ =0, the condi- 
tion is 


= pt 2S mw (9) 


t= 1 
T = > z./n. (10) 
t=1 


Since E(>>2;/n) =, the estimator, 7, given by (10) is in fact unbiased, and 
it is therefore seen to fulfill the necessary and sufficient condition for attain- 
ment of lower-bound variance. It is interesting that we could have derived 
(10) in this way as an unbiased estimate of the mean which attains the informa- 
tion limit, on the basis of (5) alone, without knowledge of the variance of T or 
its lower bound. 

Now I wish to discuss the case of “bio-assay” with the logistic function, 
which I have dealt with elsewhere [5, 6]. At each of k22 dose levels x, - - - , 
x, animals are subjected to a deathly toxin—n,; animals with dose z,.? 

The probability of death at dose x; is given by the logistic function 

1 1 


P;=1-Q=— = 11) 
Q 1 + e~ (at8r:i) 1 +. enki ( ‘ 





hy ek hoe +8 (12) 
4 n—-$a 4 
Q: 


o=11(") 0 (1) 


sat (1 + em *) 


where L; is the logit of P;; a, 8 are parameters to be estimated; r;=n:—s; is 
the number of deaths observed out of n; exposed at x;; and ¢ is the probability 
of the total set of observations. If 8 is considered to be a known constant, with 
only a to be estimated, we have: 


dl ; 
D « -* ona eof, (14) 


i=] t=] 


i=1 


k 
I = V(D) = & nP AQ. (15) 
t=] 
If an estimator, 7’, of a is to attain its lower bound, we have from (5), since 


T=a+b, 


Li nPQ 
It will be noted that T is a linear function of }-r;, which is a minimal sufficient 
statistic for a (with 8 fixed). 
In my article [5] mentioned, I dealt with a number of estimators, comparing 
them with the maximum likelihood estimate. Among these, for an experiment 


1 + db/da 
A (x rer > nP.). (16) 


t=] j=l 





2 The symbol z used here has a different connotation from that in preceding paragraphs. 
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of three doses n;=10 at each dose, 8 known, a to be estimated, was the Rao- 
Blackwellized minimum logit x? estimator, obtained by applying the Rao- 
Blackwell condensation theorem to the minimum logit x? estimate.’ Regarding 
this estimate, now represented as B, I said that, within the precision of the | 
arithmetic calculations, the mean square errors attained their lower bounds. ~ 
In the present development, noting from (16 )that 7’ must be a linear function 
of >>r;, it occurred to me that I could check the correctness of this conclusion. 
On plotting B against }-r; I found that the relation was not linear; evidently 
there was an error. 

I will recall that, at the time of writing the article, I had only the formula 
(4) for the lower bound, and it was necessary to estimate b’ = db/da graphically. 
Latterly I have realized that 1+db/da can be computed as 


C(T, D) = > Brie; — BY nP; (17) 


where 


B;=B corresponding to )>r;=j, 
r= ri j=0 (1) 30, 

¢;= probability of B;= probability of rj, 
B=E (B), 


the first sum on the right-hand side of (17) being taken over j7=0 (1) 30, the 
second sum over the dosages i= 1, 2, 3. 

In the table is shown, for different a (represented by P.=logit™'a), a com- 
parison of the variance of B with its lower bound, and also p, the correlation 


LOWER BOUND AND ATTAINED VARIANCE RAO-BLACKWELLIZED 
AND MAXIMUM LIKELIHOOD ESTIMATES 








Estimates 





T=B *T = maximum likelihood 





Lower , Lower 
bound V(T) p(T, D) bound V(T) p(T, D) 





. 13095 -13101 .99977 - 15782 .15791 .99970 
. 13449 . 13478 - 99392 - 16335 . 16380 - 99863 
. 14702 - 14815 -99617 . 18371 . 18588 -99416 
- 17109 . 17383 - 99208 -23705 .24611 -98142 
. 18447 . 18780 .99112 - 28830 .30539 -97160 
. 18534 . 18795 .99304 .34544 .37040 -96573 























* Maximum likelihood estimate T = © omitted, that is, only the population of finite estimates is considered. 





3 If there is a sufficient statistic, u, and an estimator 7’ that is not a single-valued function of u, that is, if there 
is more than one value of 7 for particular u, then B=K(T| u), that is, the mean of 7"s corresponding to particular u 
will be a new estimator that has the same bias as 7 but smaller variance, and therefore smaller mean square error 
{8}, [15]. In the present case Er; is a minimal sufficient statistic for a. The minimum logit x? estimate, though it is 
a function of rj, is not a single-valued function and therefore is subject to improvement by Rao-Blackwellization. 
The maximum likelihood estimate is a single-valued function of rj and therefore not subject to Rao-Blackwelliza- 
tion, 
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between B and D, which is unity for an estimate attaining its lower bound. It 
is seen that the variance of B does not attain the information limit, but is close 
to it. 

It was of interest to recompute the lower bound for the other estimates which 
were considered, and in the table is shown also the variance, lower bound, and p 
for the maximum likelihood estimate. It will be noted that the lower bound is 
considerably higher than that of the estimate, B, and that the maximum like- 
lihood estimate itself is further removed from its bound, p being smaller than 
the corresponding value for B. 

I wish to acknowledge indebtedness to Professor Joseph Hodges for ideas 
developed during the writing of a joint article [7], from which the present 
note resulted. 


REFERENCES 


{1] Berkson, J. Discussion of paper by C. R. Rao, “Apparent anomalies and irregularities 
in maximum likelihood estimation,” Bulletin de l’ Institut International de Statistique, 
32 Session, Tokyo, (1960), in press. 

Berkson, J., “Problems recently discussed regarding estimating the logistic curve,” 
Bulletin de UV Institut International de Statistique, 37, No. 3 (1960), 207-11. 

Berkson, J., “Tables for use in estimating the normal distribution function by normit 
analysis. Part Il. Comparison between minimum normit x? estimate and the maxi- 
mum likelihood estimate,” Biometrika, 44 (1957), 420-35. 

Berkson, J., “Estimation by least squares and by maximum likelihood,” Proceedings 
of the Third Berkeley Symposium on Mathematical Statistics and Probability, edited by 
Jerzy Neyman, Berkeley and Los Angeles, University of California Press, 1956, Vol. 
1, pp. 1-13. 

Berkson, J., “Maximum likelihood and minimum x? estimates of the logistic func- 
tion,” Journal of the American Statistical Association, 50 (1955), 130-62. 

Berkson, J., “A statistically precise and relatively simple method of estimating the 
bio-assay with quantal response, based on the logistic function,” Journal of the Amer- 
ican Statistical Association, 48 (1953), 565-99. 

[7] Berkson, J. and Hodges, J. L., Jr., “A minimax estimator for the logistic function,” 
Proceedings of the Fourth Berkeley Symposium on Probability and Statistics, (1960) in 
press. 

[8] Blackwell, David, “Conditional expectation and unbiased sequential estimation,” 
Annals of Mathematical Statistics, 18 (1947), 105-10. 

(9] Cramér, Harald, Mathematical Methods of Statistics. Princeton: Princeton University 
Press, 1946. 

{10} Edgeworth, F. Y., “On the probable errors of frequency constants,” Journal Royal 
Statistical Society, 71 (1908), 662-8. 

[11] Fisher, R. A., “On the mathematical foundations of theoretical statistics,” Philo- 
sophical Transactions of the Royal Society of London, Series A, 222 (1922), 309-68. 

[12] Fréchet, Maurice, “Sur l’extension de certaines evaluations statistiques au cas de 
petits enchantillons,” Revue de l'Institut International de Statistique, 11 (1943), 182- 
205. 

[13] Pearson, Karl and Filon, L. N. G., “Mathematical contributions to the theory of 
evolution. IV. On the probable errors of frequency constants and on the influence of 
random selection on variation and correlation,” Philosophical Transactions of the 
Royal Society of London, Series A, 191 (1898), 229-311. 

[14] Rao, C. Radhakrishna, Advanced Statistical Methods in Biometric Research. New York: 
John Wiley and Sons, Inc., 1952. 

[15] Rao, C. Radhakrishna, “Information and accuracy attainable in the estimation of 
statistical parameters,” Bulletin of the Calcutta Mathematical Society, 37 (1945), 81-91. 





A MODEL FOR MIGRATION ANALYSIS* 


Rape THOMLINSON 
Los Angeles State College 


. Purpose of the Study. . ... 675 
. Construction of the Model 4 shone .. 676 
Computation of Theoretical Frequencies...... . 679 
. Indexes of Migration and Force of Attraction . 680 
. Results of Application 682 
eed Gi Demeeee, 4:5... 5:6is0s sd pie wis AHA eM MTS <Old EON) IEE shel 685 
SRE DE AIEEE OT PO LT. POEL TN ET ROE Fo Bene 685 


NOUPONE 


This paper describes a methodological study attempting to improve 
the quantitative analysis of migration data by controlling for seven 
important spatial variables. A primitive mathematical model is con- 
structed by means of which theoretical migration frequencies are cal- 
culated and then compared with actual frequencies, yielding several 
indexes and a Force of Attraction. This approach was applied to United 
States internal migration for 1935-1940. The highest scores were ob- 
tained for the Pacific area. This model may be used for human, animal, 
or inanimate movements. 


1. PURPOSE OF THE STUDY 


QO F THE three demographic variables, fertility and mortality have much more 


elaborately developed methodologies than has migration. Both the collee- 
tion and analysis of statistics of migration are relatively underdeveloped. In 
particular, the analysis of migration data is the weakest of our demographic 
skills. 

Quantitative explanations of migration have been offered by Ravenstein 
[8], Young [13], Stouffer [10], Stewart [9], Zipf [14], Dodd [3], Kulldorff 
[4], Bogue [2], and other social scientists. Some of these proposals may be 
called mathematical models. 

This article attempts to add another model to the collection of techniques for 
analyzing migration data. It does not attempt to include all sociological and 
other variables relevant to migration. It does attempt to incorporate the most 
significant spatial factors; these factors are those most susceptible to accurate 
measurement. 

The primary objective of the present mathematical model is to enable us to 
compare migration rates by controlling for seven spatial variables, somewhat 
as standardization of vital rates permits us to compare fertility and mortality 
by allowing for the age-sex structure of the population. The seven variables 
are: 1) size of area of origin, 2) size of area of destination, 3) shape of area of 
origin, 4) shape of area of destination, 5) distribution of population within area 
of origin, 6) distribution of population within area of destination, and 7) dis- 
tance moved. The model is designed to compute the number of migrants at- 





* This is a revised version of a paper entitled “A Mathematical Model for Migration,” which was presented 
at the annual meeting of the Population Association of America in Providence, Rhode Island, on April 26, 1959. 
Appreciation is due to Kingsley Davis of the University of California at Berkeley and to Arthur Campbell of the 
Scripps Foundation for Research in Population Problems. 
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tributable to these seven factors. By estimating the effects of spatial variables, 
the model allows demographers to concentrate on the other variables (largely 
sociological and economic) whose effects have been thus isolated. 

Comparability of migration figures is severely limited by our present inability 
to subtract out the effect of these seven variables. The importance of size is 
illustrated by trying to compare migration to contiguous states from Con- 
necticut with that from Pennsylvania: we cannot do it. This difficulty is almost 
universal; very few countries are composed of areas of even approximately 
equal size. Shape, especially compactness, affects the comparison of migration 
streams. The more compact an area, the less likelihood that a move of given 
distance will have its destination outside the area. Variation in population 
density within areas is also a significant factor in the evaluation of migration 
patterns. Lee and associates [5] observe that “a concentration of population 
along a boundary results in greater interstate migration than an even distribu- 
tion,” and then add that “the center of population of most states is eccentrically 
located. Metropolitan centers are frequently located along state lines.” The 
influence of distance has been studied by statisticians and demographers. [6] 

Construction of the model involves a number of series of equidistant con- 
centric circles used to approximate a double-integral formula for the probability 
of migration between any two areas. A set of equations expresses the total 
pattern of migration for distances r, re, 73, - - - , between places ai, dz, a3, - - - 
and places b;, be, bs, - - - . By dividing each area into sub-areas and computing 
theoretical frequencies between every possible pair of sub-areas, this total 
migration pattern may be calculated. The computed frequencies thus generated 
may be compared with the corresponding set of observed frequencies through 
six indexes. In turn, the indexes may be combined into a composite Force of 
Attraction, which measures the importance of non-spatial factors in relation to 
spatial ones. 


2. CONSTRUCTION OF THE MODEL 


Given a circle of radius r; with its center in area A, if part of the perimeter 
of the circle falls in a second area B, then the probability that a person start- 
ing at the center of the circle and moving a distance r will be an inter-area 
migrant is equal to 


length of arc in area B 





circumference of circle 


Averaging this probability for all possible points in A yields the probability 
that any man living in A moving r miles will go to area B (if A has a uniform 
density). Since no area has a uniform density, we must compute a weighted 
average by multiplying the probability at each point times the population at 
that point. 

A theoretical frequency distribution of inter-area migrants from A to B, 
C, D, ete., going distances rj, r2, 73, etc., can be obtained by a set of equations. 
The proportion of persons going each distance may be estimated by the ratio 
N/D, where N represents population and D represents distance. A template 
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inay be used to obtain data for a selected group of points of origin and a se- 
lected set of uniformly spaced distances and probabilities. 

Two short-cuts are necessary to make the study feasible. 1) We must restrict 
the center points to as small a number as will give an acceptable amount of 
detail for area A. 2) We must use a set of distance bands instead of the ex- 
haustive number of distances which are demanded theoretically. For example, 
if A is a state in the United States, each county might be taken as one sub-unit, 
with the point of origin of all migrants from the county placed for computa- 
tional purposes at the center of population of the county and distances of 20, 
40, 60, --- miles. 





1 
! 
a 











Fia. 1. Schematic representation of the model. 


The ratio of actual to expected migrants could indicate the relative desirabil- 
ity to both migrants and non-migrants of given areas of potential destination. 
Either in, out, or net migration can be used. These operations and their inter- 
pretation might be applied to any movement of people, animals, or things 
from one place to another—permanent changes of residence, commuting, 
shopping, vacation travel, shipping of freight, or other situations that form 
a part of human, plant, or animal ecology. 

Let us consider source area A of irregular shape and density D, terminal 
area B of irregular shape, distance of migration r;, point of origin (h, k), and 
are sg of circle intersecting area B (see Figure 1). 

We may now make the conditional probability statement: 
8eDir 


p (migrant settles in B| move of r; miles from point h, k) = . 
=r 
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The probability that any migrant traveling r; miles from a point in A settles 
in B is given by the formula 


[of snonDntty 





P(A, B;r,) = 


d b i 
Qer f D,dxdy 


Then we find 
M(A, B;r,) = [P(A, B;r)]-[M(A; rd], 


where M(A, B; r,) is the total number of migrants from A to B, and M(A; r;) 
is the number of migrants going r; miles from A. 

We are now ready to set up equations expressing the total pattern of migra- 
tion between all sub-areas of A and all sub-areas of B. In order to simplify the 
presentation, let us assume that area A has three sub-areas a, a2, and a3, that 
area B has two sub-areas b,; and be, and that there are four distances: 1, re, rs, 
and ry. Using the preceding formula, we generate the following set of equations: 


M (a,, a) = [P(ax, a1; 171) |[M (ai; ri)| + [P(a, a1; 72) |[M (ai; r2) | + [P(a,, a1;7)| 
[M(a1;1rs)] + [P(ai, a1; rs) ][M (ai; rs) ] 

M(ay, a2) = [P(ay, a2; 171) |[M (ay; 7;)] + [P (ai, a2; 12) |[M (ai; re) ] + [P(qi, a2; 7) ] 
[M (ay; rz)] + [P(ai, a2; rs) |[M (ai; ra) | 

M (as, as) = [P(as, a5; 71) ][M(ai;r1)] + [P(@, as; 72) [M (ai; r2)] + [P(q, a3; 7s) ] 
[M(ay:ra] + [P(q, as; rs) |[M (ai; rs) ] 

M (ay, by) = 

M (ay, be) = 

M (ae, a4) = 

M (ag, az) = 

M (ao, a3) = 

M (az, bs) = 

M (az, bs) = 

M (a3, a,) = 

M (a3, a2) = 

M (a3, a3) = 

M (az, bi) = 

M (az, bs) = 

M(b;, a;) = 

M (bi, a2) = 

M(b;, a3) = 

M (bi, bi) = 
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M (by, bz) = 

M (be, a3) - 

M (be, a2) = 

M (bo, a3) pe 

M (bz, bi) ae kn es . ° ° 

M (bs, bs) = [P(ba, be; 71) |[M (be; 11)] + [Pb2, bo; r2) ][M (62; r2)] + [P (ba, b2; 7) ] 
[M (bo; r3)] + [P(be, be; 74) ][M (bo; 7.) ]. 


Migration between areas is represented as follows: 


M(A, A) = >> > & M(a;, a; 1) = sum of lines 1, 2, 3, 6,7, 8, 11, 12, and 13 


iml jel k=l 
above 


M(A, B) = © & } M(a;, bk; r) = sum of lines 4, 5, 9, 10, 14, and 15 above 


i=l j=l k=1 


M(B, A) = > > & M(b;, a; 7) = sum of lines 16, 17, 18, 21, 22, and 23 above 


i=1 jo] k=l 


M(B, B) = & & } M(b;, hk; r,) = sum of lines 19, 20, 24, and 25 above 


t=] jel k=l 


where 4 = 1, 2,3, 4;j = 1, 2,3; k = 1, 2. 


3. COMPUTATION OF THEORETICAL FREQUENCIES 


In applying this model, we compute a set of expected frequencies of migrants. 
Given two areas, A and B, each having a number of sub-areas, the following 
seven steps are taken: 


1) Determine the centers of the sub-areas. Then use templates containing a 
series of equidistant concentric circles to find 2a below. 
Multiply together 
a) proportion of circumference of circle that falls in sub-area of destina- 

tion (area B) 

b) reciprocal of distance to base sub-area (area A) 
c) population of sub-area of origin (area A) 
d) population of sub-area of destination (area B) 

3) Add together these products (2a times 2b times 2c times 2d). There 
should be one such product for each distance for every sub-area of origin. 

4) After performing steps 2 and 3 for each base sub-area, add these sums of 
products. 

5) Divide this grand sum into the known total number of migrants between 
all of the base sub-areas and all of the other sub-areas, i.e., between area 
A and area B. The resultant parameter k may be used to adjust the 
components of migration from (or to) each sub-area represented by a set 
of products. 

6) Multiply & times each product (as found in 2 above). This yields the 
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expected number of persons who move between each of the base sub- 
areas and each other sub-area. 

7) Compare these expected frequencies with the observed frequencies, thus 
achieving results as promised earlier in this paper, i.e., standardization 
and relative desirability. 


As an example, take Wisconsin as area A and Illinois as area B. A has 71 
county sub-areas and B has 92 county sub-areas. The expected number of 
migrants is computed (steps 1 through 4) between each county of Wisconsin 
and each county of Illinois. The total number of products should be about 71 
times 92 times 4 (assuming that each county, on the average, is intersected by 
four circles), or 26,000. If more than two areas are involved, the number of 
products becomes correspondingly greater. 

Three points require elucidation: choice of sub-areas, distance between 
circles, and location of the centers of the sub-areas. Selection of the sub-area 
unit is usually dictated by existing political subdivisions. If possible, the sub- 
areas should all be about the same size. 

A second decision concerns the optimum increment between successive con- 
centric circles. Efficiency demands that there should be few circles; accuracy 
demands that the number must not be too small. Criteria for the optimum inter- 
circle distance are analyzed elsewhere [12]. 

Third, location of the centers requires centrographic techniques [11, 7]. The 
following method was developed: 


a) Draw seven, nine, eleven, thirteen, or fifteen uniformly-spaced vertical 
lines through the area. 

b) Find the mid-point of each line by bisecting that part of it that falls 
within the area being studied. 

c) Find the “median” vertical mid-point, i.e., that mid-point which divides 
the seven (or more) mid-points so that half are above it and half are 
below it. 

Draw a horizontal line through the “median” vertical mid-point. 

f, g, and h) Repeat these operations for a set of horizontal lines, finishing 
with a vertical line through the “median” horizontal mid-point. 

Locate the intersection of these two lines. This is the center of the area. 


4. INDEXES OF MIGRATION AND FORCE OF ATTRACTION 


Step (7) of the operations for applying the model is a comparison of ex- 
pected and observed frequencies. This comparison is made through a series of 
indexes. For each area we may compute five indexes: 


observed out-migrants , 





Io = index of out-migration = 
computed out-migrants 


observed in-migrants , 





I; = index of in-migration = . . 
computed in-migrants 


obs. net migr. — comp. net migr. 
g P o. 00 





Ty = index of net migration = ; 
computed gross migrants 





Y Bh be observed migrants within area 
= index of local migration = - — “1 
computed migrants within area 
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observed gross migrants , 





Ig = index of gross migration = ~ 
computed gross migrants 


Formulae follow for the case where A is one area and beta is all other areas 
(ie., 8=B,C, D,+-+): 


Observed migrants from A to 
Toa = e A 100 


a > > M (aj, Bx; ri) 


t=1 jl k=l 





Observed migrants from 6 to A 
Iivg''s 100 


DY LL Mh, a;; 73) 
inl jal kel 
Ivy. = [(Obs migr from A to 8) — (Obs migr from 8 to A)] 


i | SEE MG 4579 - SE EMG, a: r)| 


i=—1 jul k=l t=1 jel k=l 








-100 
DD Mai, Bes 7) + DD De M(G:, a3 71) 
i=1 j=l k=1 t=—1 j=l k=l 

Observed migrants from A to A 


p> % M (aj, aj; 7:) 


t—1 j=l 





There appears to be little purpose in calculating I ¢,4. 

We may also compute indexes for every possible move. For example, the 
index of simple migration for the move from A to B may be 127, the index for 
C to A may be 84, and so forth. Since in local migration, the move (e.g., B to B) 
and the area (B) are the same, local indexes for moves are identical with the 
corresponding local indexes for areas. Indexes may be visualized as out-migra- 
tion scores for the sending area or as in-migration to the receiving area—hence 
the term “simple migration.” We can calculate indexes of net migration for 
moves by comparing two differences: a) observed migration in the direction 
of the move minus observed migration opposite to the move, and b) computed 
migration in the direction of the move minus computed migration opposite to 
the move. The index of net migration from B to C will be equal and of opposite 
sign to the net index from C to B. As with areas, an index of gross migration is 
of little interest. In sum, we have the following indexes of moves: 


observed migrants 





index of simple migration = . ; 
computed migrants 


obs net migr — comp net migr ' 





index of net migration = : > 
computed gross migration 


Son migrin (obs migr | ten migrin (comp we? 
dir of move) opp move) 


dir of move) ¥ opp move) 100 





computed gross migration 


I, (index of local migration) is identical with Jz. 
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Lower case subscripts designate moves; upper case subscripts apply to areas. 

Interpretation of the indexes is straightforward. An index of about 100 in- 
dicates that the move or area is average in its desirability. Scores of 200 show 
great desirability. Scores of 50 identify undesirable moves or areas. 

For analysis of moves, greater value inheres in the simple migration index 
than in the net migration index. By contrast, the net migration index has 
greater value for area analysis. Since the indexes measure the importance of 
non-spatial factors (employment opportunities, climate, recreational facilities, 
presence of friends and relatives, etc.) in relation to spatial ones, and since net 
migration is the most important single migration figure for an area, we may 
call Jy the Index of Relative Attraction. 

However, the most important single figure is not always the best measure to 
use. One may construct a composite index number representing two, three, 
or all four of the migration sub-variables, plus, if desired, local migration. 

The preferred composite index is F, the Force of Attraction resulting from 
non-spatial factors in relation to spatial factors. Many combining formulas 
are possible, but the one which seems superior on a priori grounds is 


F =], — Io + 2Iy. 


F is intended only for areas, not for moves. 
A typology may be constructed for values of F: 


250 or above Extreme positive attraction 
150 to 249 High positive attraction 
50 to 149 Slight positive attraction 
— 49 to 49 Neutral attraction 
— 50 to —149 Slight negative attraction 
—150 to —249 High negative attraction 
— 250 or below Extreme negative attraction 


F-values can be transformed to a 0-100 range through solving two simul- 
taneous equations formed by substitution of maximum and minimum values 
in the formula Fy =a+bF. When this is done, the typology must be altered: 


80-100 High positive attraction ++ 
60— 79 Positive attraction 
41— 59 Neutral attraction 
21-— 40 Negative attraction 
0- 20 High negative attraction 


For any universe, the F'; typology necessarily has one area placed in each ex- 
treme category; this is not the case for the F typology. 


5. RESULTS OF APPLICATION 


Application began with an exploratory pre-test using New England as the 
universe, states as areas, and Bogue’s state economic areas [1] as sub-areas. 
Then, for a major trial, this gravitational-type model was applied to the 
United States as the universe, regions as areas, and states as sub-areas, adopt- 
ing a 200-mile interval between circles. Results were obtained for 1935-1940 
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for three different regional groupings of states: the four Census Bureau Regions, 
the nine Census Bureau Divisions, and a set of six regions defined by the writer. 

Three items were needed: a large base map, templates, and basic data. An 
equal-area projection is the best map for this purpose. Templates of transparent 
plastic contained series of concentric circles with the circumference of each 
circle marked off in percentages. Raw data were obtained from Census volumes. 

A 49-section' Working Table was used to generate theoretical migration 
frequencies. Each state in turn was taken as a place of origin, and intersection 
of any circle with another state was measured in percentages by reading the 
template. There were 3,880 intersecting arcs. The nine columns of the table 
were, respectively, 1) source sub-area, 2) base population of source sub-area, 
3) distance in miles, 4) terminal sub-area intersected, 5) per cent of circum- 
ference of circle intersecting terminal sub-area, 6) base population of terminal 
sub-area, 7) distance divided by 10,000 (to eliminate decimals), 8) product of 
items 2, 5, 6, and 7, and 9) k times column 8 or the computed migration. 


OF MOVES FOR FOUR REGIONS, 1935-40 


Residence in 1935 


TABLE 1. SIMPLE INDEXES 





Northeast 
North Central 
South 

West 








Northeast 
78 
20 
39 
240 


== 
| 
| 


North Central 


South 


26 
33 
145 
336 


West 





293 
146 

84 
332 














The figures in column 9 of the Working Table were grouped and added to 
form a 49 by 49 Master Table showing all possible inter-state and intra-state 
moves. From the Master Table, regional groupings of states determined the 
form of three tables (Census Regions, Census Divisions, and author’s six re- 
gions) of regional theoretical migration frequencies. 

Computed migration figures for each regional system were then contrasted 
with observed migration figures, yielding three regional sets of simple and net 
indexes of moves. In addition, the model permits computation of indexes of 
in-, out-, local, and net migration as well as a Force of Attraction for each 
region. 

Simple indexes of moves for the Census Regions are shown in Table 1. 
Grouping the sixteen moves into quartiles, the four moves in the highest 
quartile are North Central to West, South to West, West to West, and North- 
east to West. The lowest quartile consists of moves from Northeast to North 
Central, North Central to Northeast, South to Northeast, and North Central 
to South. 

When simple indexes of moves for the nine Census Divisions are computed, 
two Divisions loom as most desirable: the Pacific and, surprisingly, Nev’ Eng- 
land. 


! The sub-areas condetnd of 48 states plus the District of Columbia. Inclusion of the latter constituted a severe 
challenge to the model by considerably increasing the disparity in size between the largest and smallest sub-areas. 
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Indexes of net migration are presented in Table 2. Note the symmetry of the 
table. The largest positive and negative figures here are North Central to 
West and, of course, West to North Central. 

In the Division table, the extreme values are West North Central to Pacific 
and Pacific to West North Central, respectively. 

Table 3 summarizes the indexes of areas and Force of Attraction for the 
four Census Regions; Table 4 contains corresponding scores for the nine 
Census Divisions. The contents of these tables encourage a belief that the 
westward movement of population in the United States is based on a very high 
non-spatial attractiveness of the Pacific states. 


TABLE 2. INDEX OF NET MIGRATION OF MOVES FOR 
FOUR REGIONS, 1935-40 








Residence in 1935 
Residence in 1940 





Northeast North Central South 





Northeast + 7 — 10 
North Central + 4 
South -— 4 
West +121 

















TABLE 3. INDEXES OF MIGRATION AND FORCE OF ATTRACTION 
FOR FOUR REGIONS, 1935-40 


TI; Io Ty F 











Northeast 27 35 78 —- 14 
North Central 33 55 125 — 44 
South 35 48 145 — 26 
West 
































TABLE 4. INDEXES OF MIGRATION AND FORCE OF ATTRACTION 
FOR NINE DIVISIONS, 1935-40 
Area It Iy | F | Type Type 
New England ° 55 | +200 ++ 
Middle Atlantic 74 — 60 
E. N. Central + 10 
W.N. Central 273 5 | —101 
South Atlantic 285 + 83 
E. 8. Central 298 —- 10 
W. 8. Central 129 
Mountain 5 788 + 66 
Pacific 290 









































* Not meaningful. 
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6. EVALUATION AND SUGGESTIONS 


Several assumptions require consideration. First is the assumption that 
migration varies inversely with the first power of distance. Although there is 
evidence for a linear relationship, the best value of the exponent is not estab- 
lished. The basic reasoning of this model is independent of the values assigned 
to constants and parameters; necessarily, any changes in their magnitude will 
result in different calculated frequencies, but such changes will not invalidate 
the basic premises. 

There is also an assumption of equal probability of movement in any direc- 
tion, thus ignoring possible directional differences. Instead of being a source 
of bias, directional differences seem to be another possibly relevant variable 
whose influence should remain uncontrolled in order to affect the Force of 
Attraction. 

The assumption that a finite number of sub-areas and circles will suffice to 
approximate the integral is a potential weakness. However, it is a weakness of 
the genre rather than of this particular model. Indication of the adequacy of 
an inter-circle increment of 200 miles is provided by the number of blank cells 
in the Working Table. On this criterion, all indexes in Tables 1~4 are satisfac- 
tory except local New England. 

Fourth, the model assumes a closed system; i.e., every possible move has an 
origin and destination within the given universe. Since the above application 
involves internal migration only, the assumption is not realistic. The serious- 
ness of this obstacle cannot be determined wit: .-ut application of the model to 
at least an international and preferably to a world situation. 

Fifth, the model neglects the vertical dimension. Distortion may be intro- 
duced by using a flat map to compute expected frequencies, whereas actual 
surface movement takes place over a three-dimensional mountainous reality; 
but since all migration data are collected on a two-dimensional basis, the 
present analysis (or any other) must remain two-dimensional. 

Sixth, the model ignores travel routes and transport time. There would be 
virtue in measuring all distances over travel routes instead of using the shortest 
straight lines. But which means of travel: automobile, railroad, bus, airplane, 
or boat? Which route: the fastest, the cheapest, the most convenient, the most 
reliable, or the most used? Unfortunately, all such answers would be bound to 
one date because transportation prices, speed, and routes change. The shortest 
straight line distance is fixed—and much easier to measure. These variables 
are a part of the “unexplained” influences that help to produce migrations not 
“explained” by this model. 

In conclusion, the model offers a potential contribution to our ability to 
understand human, animal, and inanimate migratory activity of any kind. The 
model does not attempt to predict, but rather tries to determine the effect of 
certain measurable spatial factors to permit closer investigation of non-spatial 
factors. 
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A NOTE ON THE ASYMPTOTIC NORMALITY OF THE 
MANN-WHITNEY-WILCOXON STATISTIC 


Jack Capon 
Federal Scientific Corporation 


The Mann-Whitney-Wilcoxon statistic is of some importance in non- 
parametric statistical inference. The asymptotic normality of this sta- 
tistic has been established previously for the case where the distributions 
of the random variables, in each of the two samples, are the same. In 
the present work it is shown that this statistic is asymptotically normal 
when these distributions are not all the same. An application of this 
result to a particular hypothesis testing problem is also given. 


1. INTRODUCTION 


E ARE given independent random samples X,, ---, X, and Y;,---, Y, 

from populations with continuous cumulative distribution functions 
Fxy,=F;, t=1,---, mand Fy,=G;, j7=1, - - +, n. The Mann-Whitney-Wil- 
coxon statistic [4], [5] may be defined as* 


Vas = ff Fads, (1) 


where F,, and G, are the sample cumulative distribution functions of the X’s 
and Y’s, respectively. This test statistic is of some importance in nonparametric 
statistical inference. 

The asymptotic normality of V,,, was established by Lehmann [2], using re- 
sults due to Hoeffding [1], for the case F;=F, i=1, - - - , m,G;=G,j=1---, 
n. In the present work we will show that V,,,, is asymptotically normal when the 
F’s and G’s are not all the same. The method to be used in the proof is similar to 
that used by Hoeffding [1] in showing the asymptotic normality of a class of 
random variables known as U statistics. 


2. MAIN RESULT 
We will now prove the following theorem. 
Theorem 


If m and n—~, and if 


m n 
Yat ry 
1 


j=l 


limit ———_—_—— -=0, 


m=t @ m Ps n o\ 3/2 
n>« (Lei+ Yi) 


i=] j=l 


o: = nmE[G(X,) — EG(X.)]* 
¢; nE|F(Y;) — E(F( YI", 


* All limits of integration are minus and plus infinity. 


687 








AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1961 
p: = n*/m-*E| G(X,) — E(G(Xy)I", (5) 

pj = n-E| F(Y;) - E(F(Y))I, (6) 
F(x) =m- >> F(z), (7) 


t=1 


Ge) = n- DG,(z), (8) 


j=l 


then n'/?(Vinn—E(Vn.)) has asymptotically a normal distribution with mean 
zero and variance 


mn var Vinn 


= mf FdG+ nf G dF a f F(1 — F)dG + (mn) > 2 (f ear.) 


i=l j=l 


(9) 


— mn > f (FdG;) — nm > | GaP)’ — nd | GaP, 
j=l 


t=1 j=l 


where 
E(V mn) = { Fac 


provided n var Vin, +0. 
Proof: 
Let 
Amn = 1'!?(Vinn — E(Vimn)); (11) 


Ban = n-¥? > [F(Y,) — E(F(Y))] — mm ¥ [G(X) — EG(X))]. 12) 


j=l t=1 


The method of proof is to show that the random variable A,,, converges in prob- 
ability to the random variable B,,,. In order to establish convergence in prob- 
ability it is sufficient to prove that A,,, converges in mean square to B,,.,. It 
will be straightforward to show that B,,, has asymptotically a normal distribu- 
tion by simply applying the central limit theorem for sums of independent and 
non-identically distributed random variables. Thus, we will have established 
that A,,, converges in probability to a normally distributed random variable, 
which, in turn, will imply that A,,,, has asymptotically a normal distribution. 
We begin the proof by showing that B,,, has asymptotically a normal dis- 
tribution. As mentioned previously, this is easily done by using the central 
limit theorem for sums of independent and non-identically distributed random 
variables. The application of this theorem yields the result that a sufficient con- 
dition for the asymptotic normality of Ba» is that given in (2), [3, pp. 275-7]. 
We may write 
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Vinn — E(Vinn) = J Pade, - [ Fat 


i f (Fn — F)d(G. — G) + f FaG, —G) + f (F. — Fda. 


However, we have that 


f Fac. -G)= f Fac, = e(f FaG.) =n v KY) — E(F(Y;)), (14) 


fe — F)dG = —- f Gd(F,, — F) = — m“ > G(X,) — E(G(X))). (15) 
The sum of the two terms in (14), (15) is n~"/?B,,,. Therefore 
n—!2(Aun — Ban) = f (Fm — F)d(G, — G) 
= (mn) > f (hs — Poa, - @) 


fn fant 


P(x) = 1, x > Xi, 


1 
0, otherwise, 
1 


G(x) = io S > Y;, 
= otherwise. 


It is possible to rewrite the summand in (16) as 
f (F; — F)d(G,; — G) = FY) — FAY) + G(X) —- f GAF,, 
so that 
B( f (F; — Fia(G; - G)) 
= B(var(P(Y)| YD) — 2BR(Y) — FAY) (G(X) — f GAR )XD 


+ var G;(X;) 
= E(var(F(¥;)| Y;)) — var G,(X,), 


since 
E((P AY) — FY;))|Xi) = 1 — G(X) - f rac, = f car, — G;(X,). (21) 


The Bernoulli random variable F;(Y,;) has a variance which is less than or 
equal to } for all Y;, so that we obtain from (16) 


E(Amn — Bmn)* < 1/4m. (22) 
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Therefore, we have that 


limit E(Amn — Bn) = 0, (23) 
which shows that A,,, converges in mean square to B,,,,. The variance of Vn» is 
obtained from (13), (14), (15), (20), and is easily seen to be in agreement with 
(9). This completes the proof of the theorem. 

The following corollary follows immediately from the theorem by showing 
that (2) is satisfied when F;=F,i=1, ---,mandG;=G,j=1,---,n. 
Corollary (Lehmann [2}) 
If m and n—~, such that 
limit n/m < ~, Be +. * ¢ ae 


m—> @ 


n> 2 


have a common continuous cumulative distribution function Ff, Yi, ---, Yn 
have a common continuous cumulative distribution function G, then n'/?(Vinn 
—E(Vmn)) has asymptotically a normal distribution with mean zero and vari- 
ance 
mn var Ving = (m+ n+ 1)/12 + (m — 1I)(A — &:) + (n — IA — a) (24) 
— (m+n — 1), 


where 


- f Fae, 
“ f F'dG, 


provided n var Vi», ~0. 
Proof: 
The quantity in (2) is less than or equal to the following 


(n/m)?/2(m-1/2) + n-1/2 


i re es a 


an («n im) ( f car-( f Gar) ) + ( f F dG— (J fraa)’))" 


Using the Schwarz inequality we obtain that 


( f car) < f GdF, 
( f rag) < f F'dG, 
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with equality if and only if G=K, F=K’',0<K, K’<1. If we exclude these de- 
generate distributions, then the denominator of (29) is positive, and (2) is satis- 
fied. This completes the proof of the corollary. 


3. APPLICATION 


It is of interest to give an example where our results apply. They are applica- 
ble for the problem of testing 


Ho: Fx, = Fy;,, t=1,- 
against 
H: Fy, = F,, t= 
Fy,=G;, j 


The limiting distribution of V,,, under Ho, H, may be obtained by using the 
corollary, theorem, respectively, provided, of course, that all regularity condi- 
tions are satisfied. 
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ESTIMATION OF LOCATION AND SCALE PARAMETERS IN A 
TRUNCATED GROUPED SECH SQUARE DISTRIBUTION 


P. R. Fisk 
University of Aberdeen 


The iterative calculations required to obtain the maximum likelihood 
estimates of a and @ in the distribution function dF (z) =e*/(1+e*)*dz, 
where z=a-+z, are presented for doubly truncated, upper and lower 
truncated, and untruncated grouped samples. The calculations needed 
to obtain the maximum likelihood estimator of Pareto’s constant are 
also given for a lower truncated grouped sample with both equal and 
unequal grouping. Asymptotic variances and covariances are derived 
for all estimators considered. 


1. INTRODUCTION 


F: a previous paper [7] the author suggested the use of the sech? distribution, 


iad 
dz, —-ao<r<o 


dF(x) = ————— dz 
(1 4 e***)? 


for the graduation of income distributions. The same distribution has been 
suggested, for example by Cox [5], as an alternative model to the log normal in 
studies of the distribution of life times. This function also occurs as the rate of 
infection in an epidemic model with zero death and immigration rates [3]. In 
all these cases estimates of the parameters could be obtained by a quasi jeast 
square method, used previously by the author, from the linear relationship 


In {F(x)/1 — F(z)} =a+frrt+e (1) 


where the ¢ are random error terms. This is clearly not a satisfactory method 
of estimation because the requirement of independent errors with constant 
variance is not satisfied. A more satisfactory linear method, when the data are 
grouped and untruncated, is that suggested by Hammersley and Morton [8]. 
Alternatively, when the data are ungrouped but censored the linear estimators 
suggested by Plackett [9] could be used. Both sets of estimators are best linear 
unbiassed estimators in the sense of Aitken [1]. Plackett’s linear estimators re- 
quire the calculation of the expected values of ordered logistic variables which 
could, for a sample of any appreciable size, make the computations tedious. The 
purpose of this paper is to present formulae for the calculation of maximum 
likelihood estimates of scale and location parameters for this distribution. 
The distributions of income and life times are usually truncated or censored; 
the former because a tax exemption limit usually exists below which income re- 
cipients are not obliged to make a tax return, the latter because the tests have 
to terminate within a finite time when some of the test items have not yet 
failed. The data in both cases are either too bulky to be used without grouping, 
or can only be obtained in grouped form. The formulae given here will, there- 
fore, relate to a sample given in grouped form which is truncated at both ex- 
tremes. The censored sample is obviously a special case of a grouped distribu- 
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tion. The adjustments to these formulae for single truncation to the right or 
left and for grouped untruncated data are given in a separate table. The formu- 
lae required to obtain the maximum likelihood estimate of the single parameter 
in the Pareto distribution are also given. 


2. MAXIMUM LIKELIHOOD ESTIMATING EQUATIONS 


We may suppose that a sample of size n is taken from a population with dis- 
tribution function defined by 


stacatnig! | 
(1 + e*)? (2) 
zZ a+ Bx —-~o<2r< w 


dF (x) 


where a and # are parameters of location and scale, respectively. The only in- 
formation available from the sample is the number of observations falling be- 
tween consecutive values of a fixed partitioning of the range of x. This parti- 
tioning will be defined by a fixed set of values 2, 2, - - - , 2%, where the dif- 
ference between consecutive values need not be constant. We define x_; and 
T%41 as lying at — © and + ~, respectively. Denote the number of observations 
falling in the 7th group, with end points (2;1, 2;), by n,, and the total number 
of observations by 
k+1 


n= > n. 


t=0 


For the doubly truncated distribution no =m4,=0. We define P; as 


[var@ = e*4/] + ei, (3) 


If truncation exists at both extremities we may suppose that the sample was 
drawn from the total population with a probability of an observation falling in 
the ith group 

;=0 i=0,k+1 
P; jet Py ‘ 
= a ar 2 
P, — Po 


The likelihood of obtaining the sample observed is simply 


. 
nif] 


int (nj)! 


from which the maximum likelihood equations can easily be obtained. In 
deriving the likelihood equations we have written the probabilities, x,, as 


mn 


(1 + e*)(1 + e*) sinh 7; ' 
= — —_——_—_——— -———- X @) ot ti — Te — 2%)/2 
(i +e)(1 +e%) sinhy exp (Ai + %1-%—2 )/2} 


un 
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mi = (2; — 2:-1)/2 
n = (z — 20)/2. 


It is obvious, from the form of z;, that linear likelihood equations in the un- 
known parameters cannot be obtained, thus the iterative method of solution, 
known as the method of scoring, has been employed. An adequate explanation 
is given in Rao [10, p. 165]. The calculations required are given in the form of 
a computing recipe, obtained from a simple linear transformation of the normal 
equations. 

1. Obtain initial guesses of the parameters a and 8. This can most easily be 
done by assuming the distribution to be untruncated and using the quasi least 
squares method described in the introduction, or by plotting In { F(x)/1—F(z) } 
against x on graph paper and guessing the value of slope and intercept. If the 
group intervals are equal then a simple method of obtaining initial values is 
given by Amoroso [2]. 

2. With these initial values calculate z; for each of the values of x; at the 
group endings. The corresponding values of P; can be obtained from Fisher and 
Yates [6, Table VII or Table XI]. 

3. Using these values of z; and P; calculate 


iz 
Si Po + P, = a y (n; + Nip wP; 


Nn ind 


i 
= z0(Po -_ 3) + zn (Px ‘ses +) elias 2, (n; + Nig ve(Ps cad 3) 


n t=0 
] 


P 
z. nm; ctnh n; — 9 ctnh 9 
nN jini 


k 


ps wiP Q: 


i=0 


P 
Zz w ziPQ; 


i=0 


= > w2z?P Q; 


t=(0 


= > # ne esch? n; — 7? esch? n 


t=1 


where 

P;-— Pir 
Pi— Po 
= Fit Fini 
=-(1-#) 
=-(1- %) 


Q=1- P; 
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4. Using these sums calculate 
D = SS. + 87) — S83? 
A = 8,(S2 + S83) — SiS; 
B = 8,(S¢ + Sz) — Ss(S2 + Ss) 


5. Adjusted values of the “estimates” may be obtained from 


ai (1+<)+5 
“taht y D/" D 


a= ai(1 + 
(1+) 


6.. With these adjusted values of a and 8 recalculate steps 2 to 5. This process 
will continue until the differences between successive values of the “estimates” 
are less than some pre-assigned values. 

7. At the last stage of iteration the asymptotic variances and covariance of 
& and @ are given by 

var (&) = (S¢ a S7 o- 2S; oa &'8,)/nD 
var (8) = B%S,/nD 
cov (@, 8) = B(@S, — Ss)/nD 

The computing recipes for other types of truncated sample are the same as 
those given above with appropriate alterations to the sums. The alterations re- 
quired for the singly truncated and the grouped untruncated sample are given 


in Table 1. The estimating equations for the ungrouped untruncated sample 
are as follows. With initial values of a and 8 calculate 


) n 
1-—- > P; 


N jm 
° n 
1 + z—- — Zz z,P; 
N jm} 


where the z; and P,; are as defined by (2) and (3), and Z is the simple arithmetic 
mean of the z;. The adjusted values of a and # are then obtained from 


ay = ao[1 + 9S2/(x? + 3)] + 38, 
8B: = Boll + 9S2/(x? + 3)]. 
The process will continue until a predetermined degree of computational ac- 


curacy has been attained. At the final stage of the iterative process the asymp- 
totic variances and covariance of @ and § will be given by 


3 
var (a) = z [l + 382/(e? + 3)] 


var (8) = 982/n(x? + 3) 
cov (43) = 943/n(x? + 3). 
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3. MAXIMUM LIKELIHOOD ESTIMATION FOR THE PARETO DISTRIBUTION 


It was shown in a previous paper [7] that the sech? distribution is asymp- 
totically equivalent to a form of the Pareto distribution, 


dG(t) = At-®Ydt 


at extreme values of the argument; where z, in the equations of the last section, 
is the logarithm of income, t. Most interest lies in the Pareto constant, 8, which 
provides a measure of the degree of inequality of income distribution. This 
constant is frequently estimated by a quasi least squares method from the 
equation 

log y = C — Blogt (4) 


where 1—y=/{dG(z), t is income, and C is a constant which depends on both 
Pareto’s constant, 8, and the lowest value of the income range, u. The value of 
u is rarely known and of little intrinsic interest. It might be expected that any 
error terms in (4) will be highly intercorrelated, thus one would not expect the 
quasi least squares estimator of 8 to be the most efficient. If we assume that the 
value u is less than the lowest observed income we may calculate the maximum 
likelihood estimate for a truncated Pareto distribution on analogous lines to 
those given in the previous section. It is assumed that the data are given in 
grouped form. 

The probability that an observation, drawn at random from the grouped 


truncated Pareto distribution, should fall in the 7th group is 
r= (e~Bri-1 ~— e8zi) eBzo i= 1, 2, oo k +] 
(5) 
= 0 i=0 


where z is the logarithm of income. In deriving the maximum likelihood equa- 
tions we have preferred to write 7; as 


x; = 2exp [xo — 3(x; + 2i-1)] X sinh 9; 
Ti41 = exp B(xo — 2x) 
The computing recipe is: 
1. Obtain an initial value of 8, possibly by the quasi least squares method, 
or by the use of graph paper. 


2. With this value of 8 calculate #; using (5). 
3. Calculate 


1 k 
-— 5 4 p de (nj + Nignti + masta 


n 


i=O 


Hh 
“~~ nm etnh n; 
NB imi 


1 k 
eg Ae 2 2 
S; 3 yp #m? esch 2n,; 
where 
ni = Blas — Xi-1)/2. 
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4. Obtain a second approximation to the maximum likelihood “estimate” 
of 8 from 


Sit Ss 


Bi = Bot+ 3, 


5. With this value of 8 repeat steps 2, 3, and 4 until the difference between 
successive values of 8 are less than some preassigned level. It is obvious that S; 
remains a constant throughout the iteration. 

6. The asymptotic variance of § at the last stage of iteration will be given by 


var (8) = (nS;)-}. 
If the values of x; (¢=0, 1, -- - , k) are taken at equal intervals; that is, if 


the income group endings are in geometric progression; the maximum likeli- 
hood estimator of 6 will be given by 


A 


2 
6B = — — etnh™ (S,/K) 
v 


K = =(1 ae =) 
2 n 


and v is the common group interval. The asymptotic variance of this estimator 
is 
var (8) = 4[nv?(1 — #241) esch? n]- 
where 
n = Bv/2. 
4. APPLICATION TO THE DISTRIBUTION OF INCOMES IN U.S.A., 1918 
In the previous paper [7] it was suggested that a total income distribution 
might be graduated using the sech square distribution with 
20> 5;(ay + By In t) 4. (1 om 5;) (ae ob Bo In t) (6) 
where 
1 t<t; 
= 0 t> t. 
Reference should be made to this paper for a discussion on the relative merits 
of this distribution and the alternative log normal distribution. This form of 
the sech square distribution was fitted to the income distributions given in 
Champernowne [4, Tables II and IX], and the example given here refers to the 
distribution of incomes in the U.S.A. for 1918 as presented in Champernowne’s 
paper. 


If the maximum likelihood method is used, the model given by (6) would re- 
quire that the two parts of the distribution, above and below ¢;, be treated 
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separately. The exact value of t; need not be known. Any value in its immediate 
vicinity would serve to divide the total distribution without altering the 
values of the maximum likelihood estimates greatly. In this example it is 
assumed that the point of division is at $3100 which is the group ending ap- 
parently nearest to the point of division. The distribution above this value fol- 
lows very closely the Pareto distribution, thus this approximation to the sech? 
distribution was fitted. 

This example is intended to illustrate the use of the formulae given above. It 
is not suggested that all total income distributions will necessarily permit the 
treatment given here to be used. As was stated in the previous paper, it may 
prove more fruitful to analyze changes in the homogeneous distributions which 
cumulate to create the total income distribution. Because of the nature of much 
data on income distributions, particularly that obtained from tax records, 
truncation and grouping are likely to be found in these homogeneous distribu- 
tions also. 

The maximum likelihood equations for the two sections of the total distribu- 
tion separate out quite easily. Initial values of the parameters were obtained by 
the quasi least squares method (1) and were those used to calculate the gradu- 
ated distributions in the previous paper [7, Table 2]. The first stages of the 
iteration are given in Tables 2 and 3. The number of figures carried in each of 
the columns of Table 2 is sufficient to give the values of the parameters correct 
to three figures. In the later iterations an extra decimal place was given to P; 
and rounding was cut to the minimum. This provided at least five-figure accu- 
racy. Similar remarks apply to Table 3. The values of the parameters at the 
different stages of iteration are 


ay By 


Initial value — 23.844 3.3840 
stage 1 — 25.286 3.5997 
stage 2 — 25.354 3.6095 
stage 3 —25.353 3.6094 


with estimated asymptotic variances of the final estimates, 
var (@:) = 0.027854 = (0.1669)? 
var (8;) = 0.000564 = (0.0237)? 
var (2) = 0.001712 = (0.0414)*. 


The small values of the sampling variances obtained here are to be expected 
with samples of such large size, It might be supposed that any bias in the 
maximum likelihood estimates will also be small for the same reason. Thus, the 
difference between the initial, quasi least squares, estimate and the maximum 
likelihood estimate gives some indication of the degree of bias of the former. 

The two sections of the distribution were tested separately for goodness of 
fit. The values of x? for each section, both with nine degrees of freedom, was 
11.4 for the first section and 8.8 for the tail—neither of which values indicates a 
significant deviation of the data from the expected values. The actual and 
graduated distributions are set out in Table 4. It will be noted that the per- 
centage difference between actual and expected values is much less than those 
given in the previous paper [7, Table 2] in most groups below $25,000, but 
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TABLE 2. DISTRIBUTION OF INCOMES IN U.S.A.; 1918 
($400-$3 , 100) 


doubly prapetiied ganget sech? distribution 








P; 


. nitn 
Group + 3) 


Income A 
$00’s (000’s) (000’s) ca 





0 2,512. 
2,512.0 | 4,666. 
2,154.0 | 4,822. 
2,668.0 | 5,681 
9 | 3,013.0 | 6,157. 

9-10 | 3,144.7 | 6,219 
10-11 | 3,074.4 | 5,924 
11-12 | 2,850.5 | 5,385. 
12-13 | 2,535.3 | 4,741. 
13-14 | 2,205.7 | 9,260. 
14-20 | 7,054.3 |10,262. 
20 31 | 3,208.3 | 3,208. 
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0.02665 .09514 3.29404 — — 
0.09000 -19773 | —0.30516 .6786 . 1524 
0.13290 .22261 | —0.22763 .9180 .0226 
0.17580 .21483 | —0.19088 .4903 .0171 
0.21178 .17493 | —0.13869 .1163 .0130 
. 23684 .11060 | —90.07911 .6308 -0107 


1 
—8,076. 1 
1 
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. 24868 .03631 | —0.02362 .2846 | 1.0086 
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6,204 
14-20 | 19,273 
20-31 | 10,779. 


0 

0 

0.24862 .03704 0.02216 .8288 -0072 
0.23935 . 10029 0.05548 .4524 .0061 
0.22392 . 15003 0.18854 .0100 .0053 
0.11502 .21601 0.60979 
0.03247 -10910 | —3.00586 
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n =3.465 ctnh » = 1.0020 n ctnh » =3.4719 n® csch? » = 0.04786 

S,= 0.02088 D =0.173932 

S:= 2.44957 A =0.011086 

S;, = —2.41298 B =0.013469 

S,= 0.289317 

Ss = —0.023927 second “estimate” of a; = —23.844 X 1.06374 +0.0774 = — 25.2864 
S_= —0.289055 second “estimate” of 8:=  3.384X1.06374 = 3.5997 
S;= 0.892216 





PARAMETERS IN A TRUNCATED GROUPED SECH SQUARE DISTRIBUTION 


TABLE 3. DISTRIBUTION OF INCOMES IN U.S.A.; 1918 
($3,100 and above) 


grouped truncated Pareto distribution 
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S; = —0.609184 
S:= 0.664958 
Ss= 0.35027 
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DISTRIBUTION OF INCOMES IN U.S.A.: 1918 


actual and fitted distribution 


TABLE 4. 





Income Group Actual distribution 


($00’s) 
0-4 
4-6 
6-7 
7-8 
8-9 
9-10 
10-11 
11-12 
12-13 
13-14 
14-20 
20-31 





31-50 
50-100 
100-250 
250-500 
500—1 ,000 
1 ,000—2 ,000 
2, 000-4 ,000 
4 ,000-—5 ,000 
5 ,000-10 ,000 


1,240.4 


(000’s) 
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587.8 
192.1 
41.12 
14.01 
4.945 
1.728 
0.248 
0.369 





2,667. 
2,980. 
3,148 .§ 
3,046. 
2,830. 
2,514. 
2,168. 
7,257 
3,106. 


~ 
= 


om bh or 


oO we 


1,201. 
628. 
204. 

34.72 
9.99 
2.872 
0.825 
0.110 
0.158 


Orem WS | 


10,000 + 0.152 0.065 


above that income level the percentage differences become large. This is in- 
evitable because of the smaller weight given to the extreme incomes in these 
formulae. 
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THE PROGRESS OF THE SCORE DURING A BASEBALL GAME 


G. R. LinpsEy 
Defence Systems Analysis Group, Defence Research Board, Ottawa, Canada 


Since a baseball game consists of a sequence of half-innings com- 
mencing in an identical manner, one is tempted to suppose that the 
progress of the score throughout a game would be well simulated by a 
sequence of random drawings from a single distribution of half-inning 
scores. On the other hand, the instincts of a baseball fan are offended at 
so simple a suggestion. The hypothesis is examined by detailed analysis 
of 782 professional games, and a supplementary analysis of a further 
1000 games. It is shown that the scoring does vary significantly in the 
early innings, so that the same distribution cannot be used for each 
inning. But, with a few exceptions, total scores, establishments of leads, 
overcoming of leads, and duration of extra-inning games as observed in 
the actual games show good agreement with theoretical calculations 
based on random sampling of the half-inning scoring distributions ob- 
served. 


1, INTRODUCTION 


BASEBALL game consists of a sequence of half-innings, all commencing in 
A an identical manner. It might therefore be supposed that a good ap- 
proximation to the result of a large sample of real games could be obtained 
from a mathematical model consisting of random drawings from a single popu- 
lation of half-inning scores. To be more explicit, two simple assumptions can 
be postulated, which could be labelled “homogeneity” and “independence.” 

“Homogeneity” implies that each half-inning of a game offers the same a 
priori probability of scoring—i.e. the distributions of runs for each half-inning 
are identical. “Independence” implies that the distribution of runs scored sub- 
sequent to the completion of any half-inning is unaffected by the scoring history 
of the game previous to that time. 

On the other hand, there are reasons to suggest that the scoring in a game 
may have a structure more complicated than that represented by a simple 
superposition of identical independent innings. Pitchers may tire and lose 
their dominance over batters as the game progresses. Once a lead is estab- 
lished the tactics may alter as the leading team tries to maintain its lead and 
the trailing team tries to overcome it. Reminiscences suggest that special 
events occur in the last half of the ninth inning. The home crowd never fails to 
remind the Goddess of Fortune of the arrival of the Lucky Seventh. Surely a 
game so replete with lore and strategy must be governed by laws deeper than 
those of Blind Chance! 

This paper attempts to examine such questions. 

The main body of observed data are taken from the results of 782 games 
played in the National, American, and International Leagues during the last 
three months of the 1958 season. All games reported during this period were re- 
corded. 

The distributions of runs are found for each inning, and compared with one 
another to test the postulate of homogeneity. Using the observed distribution 
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and assuming them to be independent, the theoretical probability distributions 
of a number of variables such as winning margin and frequency of extra innings 
are calculated. These theoretical distributions are then compared with those 
actually observed in the sample of 782 games, in order to test the postulate of 
independence. 

Some of the unexpected results are examined in a further sample of 1000 
games chosen at random from the 1959 season of the National and American 
Leagues, and some of the conventional tests for correlation are applied. 


2. DISTRIBUTION OF SCORES BY INNINGS 


The distribution of scoring of runs in each half-inning is shown in Table 1. 
The columns headed 0, 1, - - - >5 show the relative frequency with which one 
team scored z runs in the ith inning. The column headed “N,” shows the num- 
ber of ith half-innings recorded. Games abandoned due to weather before be- 
coming legal contests, or abandoned with the score tied, were not counted, so 
that the 782 games necessarily produced 1564 first, second, third and fourth 
half-innings. A small number of games were called in the fifth, sixth, seventh 
or eighth inning. Many games did not require the second half of the ninth in- 
ning, and only afew required extra innings. International League games sched- 
uled for seven innings only were excluded. 

The third last column shows the mean number of runs scored by one team 
in the 7th inning, and the second-last column gives the standard deviation ¢ 
of the distribution. The last column, headed ¢/+/N;,, gives the standard error 
of the mean. It is immediately evident that the means differ considerably and 


significantly from inning to inning. 

The means and standard deviations are also shown on the top half of Figure 1, 
in which the solid vertical black bars are centered at the mean number of runs, 
and have length 20//N;,. 


TABLE 1. RELATIVE FREQUENCY WITH WHICH ONE TEAM SCORED 
z RUNS IN THE iTH INNING—(BASED ON 782 GAMES FROM 
NL, AL, IL, 1958) 
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The bottom row of Table 1 shows the overall frequency distribution when all 
innings are combined. If the numbers in this bottom row are combined with N; 
to give the frequencies expected in the other rows for the individual innings, on 
the assumption that all innings are merely samples of the same aggregate, and 
all extra innings are combined into one row, then the differences between the 
number of runs observed and expected produce a value of chi-square which has 
a probability of only about 0.005 of being exceeded by chance alone. 





MEAN NO. OF RUNS PER HALF INNING 
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 . 
= 
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a 
a 
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PROB. OF SCORING MORE THAN THREE 











12345 67890€E 
INNING 


Fie. 1. Scoring by individual half-innings. 
Black: 782 games of 1958 
White: 1000 games of 1959 


Thus it can be concluded that the distribution of runs is not the same from 
inning to inning, and the “postulate of homogeneity” is untrue. Examination 
of Table 1 or Figure 1 shows that the mean number of runs are greatest in the 
first, third and sixth innings, and least in the second and in the extra innings. 
There is nothing remarkable about the seventh or the ninth innings, both of 
which are very similar to the aggregate distribution. 

The most notable differences occur in the first thee innings, where the mean 
scores in the first and third innings are 9.53, while the mean score in the second 
inning is only 0.38. The probabilities of deviations as large as these from the 
overall mean of 0.475 are approximately 0.03, 0.001 and 0.03 for a homogeneous 
sample. Deviations of the means for all later innings, and chi-square for the 
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later innings, are small enough to allow them to be considered as samples from 
the overall aggregate (at a level of e=.05). 

A possible explanation for this peculiar pattern of scoring in the first three 
innings is that the batting order is designed to produce maximum effectiveness 
in the first inning. The weak tail of the order tends to come up in the second 
inning, the strong head tends to reappear in the third, and the weak tail in the 
fourth. 

This pattern is even more evident if one plots the frequency of scoring three 
or more runs, obtained by adding the columns for z=3, 4, 5, and >5 runs. The 
frequencies are shown by the solid black circles on the lower half of Figure 1. It 
is seen that the third inning is the most likely to produce a big score. 


TABLE 2. THE RELATIVE FREQUENCY WITH WHICH ONE TEAM 
SCORED z RUNS IN THE iTH INNING—(BASED ON 1000 GAMES 
FROM NL, AL, 1959) 








x =Number of Runs Scored 
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The main arguments in this paper are based on the data from the 782 games 
played in 1958. However, in order to obtain additional evidence regarding re- 
sults that appeared to deserve further examination, data were also collected 
from a sample of 1000 games selected at random from the 1959 seasons of the 
National and American Leagues. No abandoned games were included in this 
sample. Table 2 shows the frequency distributions of inning-by-inning scores, 
and the means and frequencies of scoring three or more runs are plotted on 
Figure 1, using hollow white bars and circles. 

Comparison of the two tables show very close agreement. A chi-square test 
shows no significant difference between the two samples. The low scoring in 
the second inning as compared to the first and third is evident again. 

At this point we must abandon our postulate of homogeneity, and treat the 
inning-by-inning distributions individually. 

In the subsequent analysis, the scoring probabilities are deduced directly 
from the observed data of Table 1, without any attempt to replace the observa- 
tions by a mathematical law. However, examination of the shape of the dis- 
tributions is interesting for its own sake, and is described in Appendix A. 
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Since baseball games are terminated as soon as the decision is certain, the 
last half of the ninth (tenth, eleventh, - - - ) inning is incomplete for games won 
by the home team in that particular half-inning. The results in Table 2 show 
separate rows of complete and incomplete ninth and extra half-innings. The 
frequency distributions are very different for complete and incomplete half- 
innings. A winning (and therefore incomplete) last half must have at least one 
run. A large proportion of winning last halves of the ninth show two or three 
runs, and the mean is 1.88. One run is usually enough to win in an extra inning, 
and the mean score for winning last halves is 1.40. Completed halves include 
all the visiting team’s record, but exclude all winning home halves, so that the 
mean scores are low: 0.40 for completed ninth half-innings, and lower still, 
0.32, for completed extra halves. 

The basic probability function required for the calculation is f;(x), the a 
priori probability that a team will score z runs in its half of the 7th inning, pro- 
vided that the half-inning is played, and assuming it to be played to completion. 
If half-innings were always played to completion, the distributions observed 
in Tables 1 and 2 would provide direct estimates of f;(z). However, for i>9 
the distributions for complete half-innings recorded in the tables cannot be 
used directly, since they have excluded the decisive winning home halves, while 
the distributions for incomplete halves are ineligible just because they are in- 
complete. 

The problem of estimating f;(x) from the observed frequencies for i>9 is 
discussed in Appendix B. There it is concluded that f9(x) is found to be very 
nearly the same as fis(x), the mean of the first eight innings, so that the ninth 
inning does not present any unusual scoring pattern. There are too few extra 
innings to allow calculation of the tenth, eleventh, - - - innings separately, so 
a grouped distribution fe(x) applying to all extra innings is sought. fz(z) as 
calculated from Table 1 (the 782 games of 1958) does not agree with fis(x), 
showing a substantial deficit in ones and an excess of twos, threes, and fours. 
But the 1000 games of 1959 (Table 2) produce distributions consistent with the 
hypothesis that fz(x) ~fis(z). 


3. INHOMOGENEITY INHERENT IN THE SAMPLE 


The 1782 games from which all of the data in this paper have been obtained 
include two seasons, three leagues, twenty-four teams, day and night games, 
and single games and doubleheaders. If the sample were subdivided according 
to these, or other categories, it is possible that significant differences in the 
populations might be discernible. However, as was pointed out in an earlier 
study of batting averages [4, p. 200], subdivision of the sample soon reduces the 
numbers to the point where the sampling error exceeds the magnitude of any 
small effects being examined, while extension of the period of time covered will 
introduce new sources of inhomogeneity such as changes in the personnel of the 
teams. Therefore, many small effects will be measurable only if they appear con- 
sistently in large samples necessarily compounded from many categories of 
games. These are the most interesting effects for the general spectator, al- 
though a manager would prefer to know their application to the games of his 
own team exclusively. 
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In any case, it is the contention of the author that professional baseball 
games offer a very homogeneous sample. Teams differ from one another so little 
that it is very unusual for a team to win less than one third or more than two- 
thirds of their games in a season. Of the sixteen major league teams, the lowest 
mean total score for the 1958 season was 3.40, the highest 4.89, the mean 4.39, 
and the standard deviation between mean scores was 0.375. Therefore the dif- 
ference between teams, if they represented sixteen separate populations with 
means as measured over the season, would show a standard deviation of ap- 
proximately one-third of 0.375, or 0.12, for each of the nine innings, which is a 
small magnitude as compared to the standard deviation of approximately 1.0 
for the game-to-game variation in the pooled sample. 

One hundred Chicago and Washington games from the 1959 American 
League were extracted from the 1000-game sample, and their inning-by-inning 
distributions analyzed separately (as in Table 1). (These teams finished first 
and last, respectively.) The mean scores and standard deviations for the first 
eight innings combined were n»=0.40, o=0.94 for Chicago, and »=0.47, 
o=0.85 for Washington. They do show slightly smaller standard deviations 
than the pooled distribution, but ¢ is still about 2u. 

A small inhomogeneity probably does exist between the visiting and home 
halves of any inning. The home team won 55.0% of the major league games of 
1958, and 54.7% of the 1000 games of 1959. In the latter sample the total score 
of the home teams showed »=4.50, ¢=3.03, while for visitors the result was 
u=4.38, o=3.13. The differences between these are negligible as compared to 
the inning-to-inning differences of the pooled distribution. 

While it might be interesting to extract separate sources of variances due to 
various inhomogeneities, it seems much more profitable in this exploratory 
study to use pooled data and seek effects which appear in spite of whatever 
inhomogeneity may be present. 


4. INDEPENDENCE BETWEEN HALF-INNINGS 


The postulate of independence could be tested by conventional methods, 
such as computation of coefficients of correlation and comparison of conditional 
distributions. This is in fact done in Appendix C, for certain pairs of half-in- 
nings, and shows no significant correlation. The method is applied only to pairs 
of half-innings, whereas the interesting questions pertain to tendencies notice- 
able over the whole game. 

A different approach to the testing of independence is followed in the main 
body of this paper, which seems more closely related to the question at the fore- 
front of the minds of the participants and spectators during the course of a base- 
ball game, and which is likely to be more sensitive than the methods of correla- 
tion coefficients or conditional distributions. This is to examine the probability 
that the team which is behind at a particular stage of the game will be able to 
overcome the lead and win. It is possible to deduce this by probability theory, 
if independence is assumed and the f;(x) derived from the data of Table 1 used, 
and the predicted result can be compared with that actually observed in prac- 
tice. Also, in addition to the probability of the lead being overcome, several 
other distributions of interest, such as the winning margin, the total score of 





PROGRESS OF A BASEBALL SCORE 709 


each team, and the length of the game, can be computed and compared with 
observed results. 


5. LENGTH OF GAME 


It is shown in Appendix B that if we know the distribution of runs scored by 
a team in each half-inning (whether homogeneous or not), and if the scores ob- 
tained in successive half-innings are independent, then it is possible to caleu- 
late the probability that the score will be tied at the end of nine innings, and the 
probability that the two teams make equal scores during any single extra 
inning. For the data collected here, these probabilities are 0.106 and 0.566, 
which implies that, of all the games which had a ninth inning, a fraction 0.106 
would require a tenth, and that of all the games which required a tenth (elev- 


TABLE 3. THE STAGES AT WHICH VARIOUS GAMES CONCLUDED 








Sample i=9 10 11 12 13 14 15 16 





1958 | Prob. (=i) |.894 .046 .026 .015 .008 





1958 | Nz (obs) 777 67 32 22 13 
Nz; (exp) 82 38 22 12 





Nz; (obs) [1000 95 44 20 15 
Nex (exp) 106 «540 (Ci H_O—=“—ié‘ziD 





Game finished] At end of v9) During H9 | At end of H9! During E| Total 





1000 














N (obs) 424 61 420 95 | 1000 


N (exp) 400 47 447 106 





enth, twelfth, - - - ) inning a fraction 0.566 would require an eleventh (twelfth, 
thirteenth, - - - ) inning. 

The row labelled “Prob(=7)” of Table 3 shows the calculated a priori prob- 
ability that a game will require exactly i innings. The third row shows the 
number of games (out of the 782 in 1958) which actually did require an ith in- 
ning. The fourth row shows the number that would be expected, based not on 
the a priori probability but on the formula 


No (exp) = 0.106N, (obs) 
N; (exp) = 0.566N;_; (obs) fori > 11 


which is based on the number of games actually requiring the inning previous 
to the ith. 

Similar results are shown for the 1000 games of 1959. 

For both samples, the number of games actually requiring a tenth and elev- 
enth inning is rather less than predicted, but the difference is not significant to 
to the 10% level of chi-square. 

Another examination of the lengths of games can be made by observing the 
number that end without requiring the home half of the ninth inning (desig- 
nated as H9), the number that are concluded during H9, the number ending 





710 AMERICAN STATISTICAL ASSOCIATION JOURNAL, SEPTEMBER 1961 


with the conclusion of H9, and the number requiring one or more extra innings. 
These are listed in Table 3 for the 1000 games of 1959. Also listed are the 
number that would be predicted by the independent model, using the theory of 
Appendix B. The number of games decided in the last half of the ninth exceed 
the predicted number (significantly at the level «~0.04). 

Of all the 531 extra innings observed in both samples, a fraction 0.538 were 
tied (as compared to the expected fraction 0.566: the difference is not signifi- 
cant). 

It seems fair to conclude that the length of games actually observed is con- 
sistent with the prediction made on the hypothesis of independence between 
innings, although there is a slight tendency for games to be concluded in the 
last of the ninth, or in the tenth, more often than predicted. 


6. TOTAL SCORE 


The distribution of the total number of runs scored by one team in an entire 
game is shown in Table 4, in the rows labelled “Observed.” The frequencies are 
compared with those predicted on the model of independence, by calculations 
described in Appendix B, and employing estimates of f;(x) obtained from the 
data of Table 1. The mean score is 4.25, and the standard deviation is 3.11. 

Figure 2 shows the same results in graphical form, expressed as normalized 
probabilities. The continuous line shows F(x), the probability of one team 
scoring x runs in a complete game as predicted by the theory. The midpoints of 
the vertical black bars represent the frequencies observed in the 782 games of 
1958. The length of each bar is 2\/F(x)|1—F(x)]/N, where N is the number of 
observations, 1564. About two-thirds of the bars would be expected to overlap 
the predicted values on the continuous line. 

The vertical white bars show the results observed for the 1000 games of 
1959. 





TABLE 4. DISTRIBUTION OF TOTAL SCORE BY ONE TEAM 





| Total Runs: 0 1 2 3 4 5 6 
Sample |—-—— - 


Frequency 


Observed 
1958 | Expected 
(O-E) 


100 193 225 240 193 138 136 115 70 
95 167 218 233 219 184 145 108 77 
26 —26 —46* 


' 
es 
| 


Observed | 134 196 249 276 286 241 140 100 
1959 Expected | 122 214 278 298 280 236 86 138 98 
(O-E) —29 








Total Runs: 





Frequency 
Observed 
Expected 
(O-E) 


18 7 
9 5 2 


—— —— 








12 6 2 


-— - — 


Expected 
(O-E) 








| 
Observed | 14 13 10 
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The results for the larger scores are also shown with an expanded vertical 
scale on the right of the diagram. 

The rows labelled (O-E) in Table 4 show the difference (observed-expected) 
only when chi-square is larger than would be expected on 10% of occasions. 
When the value has an asterisk this signifies that chi-square exceeds the value 
for «=0.001. There are two anomalous points—a very small number of 5-run 
games in 1958 and a very large number of 11-run games in 1959—and a gen- 








PROBABILITY 














¥ T 


io (12 
RUNS 


Fia. 2. Distribution of total scores by one team. 
Black: 782 games of 1958 
White: 1000 games of 1959 


eral tendency to observe more large scores (over 10 runs) than predicted. With 
these exceptions, the agreement is good. 

The excess number of large scores suggests the possibility that a team against 
which a large score is being made might regard the game as hopelessly lost, and 
decide to leave the serving but ineffective pitcher to absorb additional runs, 
instead of tiring out more relief pitchers in a continuing effort to halt their 
opponents. However, this would imply that games in which a large total score 
was obtained by one team would show a disproportionate share of their runs 
coming in the later innings, but an examination of those games in which the 
winner scored more than 10 runs showed no such effect. 

It may be that there are a small number of occasions on which the batters 
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are in abnormally good form, and for which the statistics do not belong to the 
general population. 


7. ESTABLISHMENT OF A LEAD 


As well as the number of runs scored in each half-inning, the difference be- 
tween the scores of the two teams totalled up to the end of each full inning was 
recorded. Table 5 shows the number of games in which a lead of / runs had been 
established by the end of the ith inning, and at the end of the game, for the 782 
games of 1958. 


TABLE 5. ESTABLISHMENT OF A LEAD 











Inning: 1 y 3 4 5 7 


Total: 782 y 782 782 g 779 








Established 
Expected 
(O-E) 





Established 
Expected 
(O-E) 





Established 
Expected 
(O-E) 








Established 
Expected 
(O-E) 





Established 
Expected 
(O-E) 





Established 
Expected 
(O-E) 





Established 
Expected 
(O-E) 





«<0.1 

















In Appendix B, a formula is derived for the probability that one team will 
have established a lead of | at the end of 7 innings, on the assumption that the 
probability of scoring z runs in the ith inning is f;(x), and that there is no cor- 
relation between innings or with the score being made by the opposing team. 
The lines in Table 5 labelled “Expected” show the number predicted by this 
formula. 

The differences between the numbers observed and expected are entered in 
the lines of Table 5 labelled (O-E) only when the difference has a chi-square cor- 
responding to «<0.1. When e<0.02 an asterisk is attached to the number. 

In the bottom line, the value of ¢ appropriate to chi-square for all observa- 
tions in the inning is shown only if «<0.1. 

Figures 3A and 3B show the results in graphical form, with the observed fre- 
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quencies of leads of 0, 2, 4, 6, and 8 runs being shown by the circles on Figure 3A 
and leads of odd value on Figure 3B. The observed frequencies for 1=0 are 
shown as solid black circles, to distinguish them from the other hollow circles. 
Circles are shown only where the number has been observed in at least 10 
games. 

The continuous lines on Figures 3A and 3B represent the theoretical fre- 
quencies based on the postulate of independence between innings. The length 
of the vertical bars on the circles representing observed frequencies indicate 
twice the standard deviation, 2\/pqg/N, where Np is the theoretical frequency. 

All curves except 1/=0 start from 0, since the lead is 0 when the game begins. 
The probability of the score being tied (i.e. 1=0) decreases as the game pro- 
ceeds. dropping to 0 for the final result, since extra innings must be played until 
140. The probability of a small lead (/=1 or 2) increases for a few innings, but 
then decreases in the later innings because of the steady rise in the probability 
of larger leads (/>2). 

The practice of stopping the game as soon as the decision is certain causes 
sudden deviation in these probabilities for the scores at the end of the ninth 
inning and at the end of the game. If the home team leads by />1 at the end of 
the eighth, and the visitors reduce the lead but do not erase it in their half of 
the ninth, then the home half of the ninth is not played and there is no oppor- 
tunity to increase the lead once more. The home half of the ninth, and of extra 
innings, always commences with the visitors ahead or the score tied, and ceases 
if and when the home team achieves a lead of 1 (except in the circumstances 
that the play that scores the winning run also scores additional runs, as might 
occur if a tie were broken by a home run with men on base). Thus, almost all 
games won in the lower half of the ninth or extra innings will show a final mar- 
gin of one. The formulae of Appendix B predict that 82% of games decided in 
extra innings will be by a margin of one run. 

The consequence of these two factors (i.e. not starting the home half of the 
ninth if the home team is ahead, and not finishing the home half. of the ninth or 
extra inning if the home team gets ahead) is to raise the probability for a lead 
of 1=1 at the end of the ninth, and especially at the end of the game, with a 
corresponding adjustment to the other probabilities. 

The final winning margin at the end of the game, indicated in the last columns 
of Table 5, and above the label “F” on Figures 3A and 3B, is given for the 1000 
games of 1959 as well and is also displayed separately on Figure 4. The continu- 
ous line shows the margin calculated on the hypothesis of independence, ex- 
pressed as a normalized probability. The vertical black bars show the results ob- 
served for the 782 games of 1958, and the vertical white bars for the 1000 games 
of 1959. As before, the centre of the bar marks the observed frequency, and the 
total length of the bar is 2\/pq/N where the predicted frequency is Np. 

It is evident from inspection of Figure 4 that the fit is extremely satisfactory, 
except for the excess of margins of ten or more. If we combine the results for 
1958 and 1959 the excess for 1>10 is significant at the level of «=.005. 

The slight excess of large final margins may be caused by the excess number 
of games with high scores already noted in Table 4 and Figure 2, since games 
with one team making a large total score will tend to show larger than average 
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winning margins. (The mean winning margin for the games of 1959 in which 
the winner scored more than 10 runs was 7.6 runs). 
The observed mean winning margin for all games is 3.3 runs. 


8. OVERCOMING A LEAD AND WINNING 


When the frequency of establishment of a lead was recorded, it was also 
noted whether the leading team eventually won or lost the game. 

Table 6 gives the number of occasions on which the lead of / (established at 
the end of the ith inning with frequency given in Table 5) was overcome, and 
the game won by the team which had been behind. Appendix B shows how this 
frequency can be predicted on the postulate of independence. Table 7 shows 


TABLE 6. OVERCOMING OF A LEAD AND WINNING 








Inning: 1 2 3 4 6 7 








Total: 782 782 782 779 





Overcome: 59 79 87 7 53 
Expected: 75 97 88 70 55 
(O-E) —16 —18 a= —_ 
Overcome: 31 30 82 26 
Expected: 30 38 84 26 
(O-E) — — ead mx 








Overcome: ¢ 22 11 
Expected: 24 15 
(O-E) oa a 
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the computed frequencies expressed as probabilities, while Table 6 shows them 
as number of games (out of 782) in the lines labelled “Expected.” The lines 
labelled (O-E) show the difference between observed and expected only when 
chi-square would attain such a magnitude 10% or less of the time. Chi-square 
for the combined totals of each inning is less than the value for e=0.1 for every 
inning. 
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TABLE 7. CALCULATED PROBABILITY THAT TEAM WITH LEAD 1 AT 
END OF iTH INNING WILL WIN THE GAME 








Inning t 1 2 3 4 5 6 





Lead I 
.939 .944 j .970 .979 .988 

.905 .913 .932 .947 -961 .974 .987 .995 
.857 .867 .889 .908 .930 .949 . 980 .987 
.793 .803 .827 .849 .877 .907 .940 .974 
.700 .720 .741 . 764 .793 .831 .878 .936 
.610 .617 .630 .647 .668 .705 .776 .846 
. 500 .500 .500 .500 .500 . 500 .500 .500 
.389 .383 .368 .353 .331 .295 .244 .153 
.290 . 289 .257 . 236 . 207 .168 .122 .063 
.207 .196 .171 .150 .122 .093 .060 .025 
.142 .133 .109 .091 .070 .050 .029 O11 
.095 .087 .067 .053 .038 .025 -013 .004 
—6 .061 .054 .039 .029 .020 .012 .005 .000 








Figures 5A and 5B show the observed and expected frequencies of overcom- 
ing 4 lead of I (i.e. of winning after having established a lead of —/) at the end 
of the 7th inning. Figure 5A shows observed frequencies for margins of 1= —1 
and —3 indicated by the circles with standard deviation bars, and calculated 
frequencies for /= —1, —3, —5 and —7, indicated by the continuous lines. Fig- 
ure 5B shows observed values of l= —2, and —4, and calculated values for 
l= —2, —4, —6 and —8. In all cases, as might be expected, the probability of 
overcoming a given deficit decreases as the game proceeds. The theoretical 
values at 7=0 have no significance in an ordinary game, since it begins with a 
score of 0-0. They would apply only if there were a handicap, or in the last 
game of a series played with the total run score to count, a practice not adopted 
in baseball. The curve for 1=0 would be a straight line at P=0.5, since with 
the score tied, either team is equally likely to win. 

To illustrate, a team that is two runs behind at the end of the fourth inning 
has a theoretical probability of 0.236 of winning the game, as given by Table 7. 
Table 5 shows that a lead of 2 was established in 138 games (out of 782). Table 
6 shows that the team that was 2 runs behind eventually won 42 of them. The 
expected number is 0.236 138=33. Figure 5B shows the observed point at 
42/138 =0.303, with a standard error of ¢ = »/pg/N = +/(.236) (.764)/128 = .036. 
Chi-square is (42—33)*/33=2.5, which for 1 degree of freedom has e>0.1. 
Therefore, the lead was overcome rather more often than predicted by theory, 
but the difference is not statistically significant at the level of e=0.1. 

It may be concluded from these data that the predicted frequency of over- 
coming a lead is confirmed quite well by observation, so that the numbers in 
Table 7 (based on the postulate of independence) provide a measure of the 
probability that a team can overcome a lead and win the game. 








9. CONCLUSIONS 


1. The distribution of runs scored per half-inning is not the same for all in- 
nings. The expected number of runs scored is significantly less in the second in- 
ning than in the first or third. The mean expectation per half-inning is 0.48 runs. 
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2. The total score achieved by one team is consistent with a model based on 
random sampling from separate and independent half-inning samples, except 
that the number of cases of large scores of more than ten runs is somewhat in 
excess of the prediction based on the theory of independence. The mean total 
score for one team is 4.2 runs. 

3. The establishment of a lead appears to follow the postulate of independ- 
ence quite well except for a slight tendency toward very large final winning 
margins (of 10 runs or more). 

4. The frequency with which a lead is overcome and the game won agrees 
very well with the postulate of independence. It does not appear that there is 
any significant tendency for the trailing team to overcome the lead either more 
or less frequently than would be predicted by the model of random drawings 
from successive half-innings. From Table 7 it is possible to estimate the prob- 
ability that the game will eventually be won by the team which is ahead. 

5. The number and length of extra-inning games is not inconsistent with the 
postulate of independence, although the number of games ended in the ninth 
and tenth innings is somewhat greater than would be expected. 

6. It seems logical to attribute the inhomogeneity of the first three innings 
to the structure of the batting order. 


10. REMARKS 


Perhaps these findings will be considered disappointing. There is no observ- 
able tendency for the underdog to reverse the position. Although the game is 


never over until the last man is out in the ninth inning, it is lost at the 23% 
level of probability if the team is more than 2 runs behind when they start the 
ninth inning. There is nothing unusual about the seventh inning. The peculiar 
innings from the point of view of scoring are the first three. The number of runs 
scored in the ninth inning is normal, but the 6% of games won during the last 
half of the ninth exceed the expected proportion. 

One way in which these results could be useful to a manager is when he is 
considering changing personnel during the game. If he wishes to rest a winning 
pitcher when the game appears to be won, or to risk the use of a good but in- 
jured player in order to overcome a deficit, he will want an estimate of the prob- 
ability that the present status will be reversed. He would use the information 
in conjunction with other statistical data [4] and his evaluation of the immedi- 
ate form and special skills of his players. 

Another application is to test certain scoring records believed to be unusual. 
For instance, the Chicago White Sox won 35 games by one run during the 1959 
American League season, which was generally considered to be remarkable. 
From Figure 4, the expected number would be 4 X0.327 X 15425, (assuming 
that .327 of their games would be decided by one run, and that they would win 
half of these). Taking o=./Npq=4.6 the difference (O-E) =10=2.2¢ would 
be expected to occur with probability 0.028. In other words their accomplish- 
ment had a probability of about 3%, and would be expected to be repeated or 
exceeded by one of sixteen teams about once in every two seasons. 

In a full season of two eight-team major leagues we would expect the number 
of games lasting eighteen innings or more to be 1548 X0.106 X0.566' ~ 1. 
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In the World Series of 1960, the three games won by New York were by 
scores of 16-3, 10-0, and 12-0. Pittsburgh won the last game by 10-9. 

From Figure 4, the observed probability of the final margin exceeding nine 
runs in 0,033. Therefore the probability of three or more games out of seven 
having margins of ten or more is 

7 


7 
2 ( ) 033 .9677-" = 0.0014. 
r=3 r 

From Table 4, the observed probability of one team scoring more than nine 
runs in a game is 0.0655. Therefore the probability that there will be four or 
more scores of 10 or more in seven games is approximately 

14 14 
> ( ) 0656" .9345'-" = 0.015. 
r==4 r 

Therefore we can conclude that the three one-sided games did constitute a 
very unusual combination, to be expected only once or twice in 1000 seven- 
game series. The four large scores are less extraordinary, and would be expected 
to occur on about 143% of seven-game series. 

Thus even if this investigation has committed the sin of exploding some 
cherished beliefs, it does permit estimates to be made of the probability of 
occurrence of certain rare accomplishments. 

APPENDIX A 
DISTRIBUTION OF RUNS WITHIN AN INNING 


The shape of the frequency distributions of Table 1 could form the subject 
of an interesting analysis. Since the standard deviations are about twice as 
great as the means, the distributions do not follow a Poisson law. 

If the negative binomial distribution [3, p. 155; 5, p. 179] 


yt+e-h 
o(y; a, p) = ( der — 
is fitted to the bottom line of Table 1 by choosing p=y/o=0.475 and 
a=p?/(o—u) =0.43, so that it has the same mean and standard deviation as the 
experimental distribution, the values listed in the line of Table 8 labelled 
(x; a, p) are obtained. Use of such a law might allow mathematical manipula- 
tion of the distributions, but it is difficult to suggest any mathematical model 
of baseball which would indicate a law of this precise form. 


TABLE 8 








Number of Runs 





Function 


0 1 2 3 a 





Observed f(z) | -- —— .737 ~=.144 
o(z; a, p) ‘ 475 | .726 - 164 
o(xz; a, p) 3 .685 | .321 .304 


.35¢1 + .65¢2 | 3 -685 | .749 = =.133 
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A model representing an extension of the negative binomial distribution is 
suggested by the fact that, if a set of trials for which the individual probability 
of success is p is repeated until a successes have been obtained, then the proba- 
bility that y failures will have occurred before the ath success is ¢(y; a, p). If a 
trial consists of the fielding team dealing with a batter, and putting him out is 
considered a success, then the probability that y men will bat but not be put 
out before the inning is over will be ¢(y; 3, p) where p is the probability that a 
player who appears at bat is put out during the inning. A player appearing at 
bat but not put out must either score or be left on base. The number left on 
base, L, can be 0, 1, 2 or 3. The number of runs scored is x=y—L. 

To be accurate, the mathematical model would need to account for all pos- 
sible plays and for their relative frequencies of occurrence. Models somewhat 
simpler than this have been tested [1, 2, 7] with success. 

A crude approximation would be to assume that in a fraction A of the innings 
in which runs were scored, one man was left on base, while two were left on in 
the remaining (1—A) scoring innings. The distribution of runs would then be 
given by: 


f(x) = Adil(z) + (1 — A)g2(z) 
where 
$:(0) = $(0; 3, p) + $(1; 3, p) 
oi(z) = o(x + 1;3, p) forz = 1,2,3,---: 
$2(0) = $(0;3, p) + (1; 3, p) + (2; 3, p) 
$:(z) = o(x + 2;3, p) forz = 1,2,3,---: 


TABLE 9. DATA FOR AMERICAN AND NATIONAL LEAGUES (1958) 





Event A.L. N.L. 
(1) Official Times at Bat 41,684 42,143 
(2) Bases on Balls 4,062 4,065 
(3) Hit by Pitcher 252 247 
(4) Sacrifice Hits 531 515 
(5) Sacrifice Flies 322 322 





(6) (1) +(2)+(8) +(4) + (5) 46 ,851 47,292 
(7) Safe Hits 10,595 11,026 
(8) Errors 1,002 1,083 
(9) Double Plays 1,313 1,287 
(10) (2) +(3) +(7) +(8) —(9) 14,598 15,134 29,732 











To estimate the constant p, the probability that a man appearing at bat will 
be put out during the inning, we could note that in many nine-inning games 
some players make four and some five appearances at bat, so that out of 
9X44 ~40 batters, 27 are put out, and p~0.68. Or, to be more methodical, we 
could use data from the major league season of 1958 [6] listed in Table 9. These 
show 94,143 batters appearing, of which 29,732 were not put out, so that 
p=1—.315=.685. 
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The last line of Table 8 shows the values of f(x) calculated for \=0.35, for 
which reasonable agreement is found with the distribution actually observed. 


APPENDIX B 
THE FIRST EIGHT INNINGS 
Let f;(x) be the probability that team A will score exactly z runs in the ith 
inning, where 1<7<8. 
We assume throughout that both teams have the same f,(z). 
Let F;(z) be the probability that team A will score exactly z runs between 
the ith and jth innings (inclusive), where i<j<8 


F(x) = fi(z). 


Tables of F,;(z) can be accumulated by successive summations such as 


Fist) = 5 Fife - 1) 


r=0 


F F.sOile - 9) 


r=0 


r=z 


LfMFude-r i<j ss. 
r=0 
The probability that A will increase their lead by exactly z during the ith 
inning is 


Gi(z) = Vf(@Mflet+r) forz=0,1,2,---. 
r=0 
G;(—2x) =G,(z) =probability that A’s lead will decrease by z during the ith 
inning. A negative lead is, of course, the same as a deficit. 
The probability that A will increase their lead by exactly x between the ith 
and jth innings (inclusive) is 


Gil) = Fil) Fole +0). 


r=( 


INNINGS WHICH MAY BE INCOMPLETE 


For i=9, the home half-inning (H9) will not be started if the home team is 
ahead at the end of the visitor’s half (V9). For i>9, the home half, if started at 
all, will be terminated if and when the home team achieves a lead. 

Define f;(x) to be the probability that team A would score z runs in their half 
of the ith inning if it were played through to completion (even after the results 
were assured). Define Sy;(xz) and Sy;(x) as the probabilities that the visitor’s 
and home halves of the 7th inning, if started at all, will produce z runs as base- 
ball is actually played. 
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Svi(x) = fi(x) 


since the visitor’s half is always completed. Thus we could use the measured 
distribution of visitor’s scores to estimate f,;(x). However, the assumption that 
the visitor’s and home team have the same distribution is open to question, 
and it seems preferable to use the data from the home halves as well. 


Sui(x) = fi(x) for i < 9, but not for i > 9. 


Define g(l) as the probability that the visitors have a lead of / at the end of 
V9. 


x 


gl) = pm Gi3(l — r)fo(r). 


r=(0 


The probability that H9 is required is then 
g(t > 0) = 2g) =a (say). 
l=0 


Out of a large sample of N games we would observe N V9’s and Na H9’s. 

Of the Na H9’s, NG.,(0) would leave the score tied and require extra innings, 
and N[1—G,(0)|/2 would leave the game won by the visitors. These 
N [1+G,»(0) |/2 H9’s would all be complete. In them the home team would 
score x runs only when z<l. 

If we recorded only completed ninth half-innings (as in Table 2) and com- 
puted the frequency Syc(x) of scoring x runs 


N * 
— [3 + Gis(0)|Sse(2) = Nfa(z) E + Da]. 


loz 


The remaining Na—N[1+G,,(0) |/2 H9’s would result in victory for the 
home team, and be terminated with less than three out. The winning margin 
will almost always be one run, but can be greater on occasion when the final 
play that scores the decisive run produces additional runs. For example, if the 
home team were one behind in H9, but produced a home run with the bases 
full, the final margin would be recorded as three runs. 

If we neglect the infrequent cases when the winning play produces a margin 
in excess of one, then all scores of z runs in incomplete H9’s will be associated 
with leads of /=x—1 for the visitors at the end of V9. But the probability of H 
scoring x under these circumstances is now f,( ><) rather than f3(z). We could 
imagine that the inning is played to completion, but a score of (1+1) only re- 
corded. 

If we recorded only incomplete halves of the ninth inning (as in Table 2) and 
computed Syor(x) the result would be 


Sasi (0) =0 


g(x — 1) = 
Sas = 9 forz ' 
agr(x) a—1/2{1+G.0(0)] 2 fulr) rz>0 








PROGRESS OF A BASEBALL SCORE 3 


If we combine all halves of the ninth (as in Table 1), and compute S,(x), the 
result is 


S9(0) = fo(0) 


S,(z) = 


{fot E + > x | + g(x — 1) Ls} forz > 0. 
loz r=z 


1 
l+a 

For extra innings, paucity of data will make it advisable to group the results 
and seek fz(x), the probability that a team starting its half of any extra inning 
would score vt runs if it played the inning to completion. 

teasoning similar to that applied to the ninth inning allows us to deduce 
Szec(x), the distribution of runs for completed extra half-innings, Szr(x) for in- 
complete extra half-innings, and Sz(x) for all extra innings. 


Sx(z) E + Da | 


rz 


€ 


S SS 
o(2) 3 + Gz(0) 
Sz(0) =0 


9 


Sa, res fort > 0 
ion" ) Lids (r) or x 


S2(0) = fr(0) 


Ser (x) 


Spe(x) = 1/2 Vfe(e + [fe(x — 1) + fe(2)] eG), forz > 0. 
NUMERICAL RESULTS 
Starting with S,(x) as tabulated in Table 1 (T1), we can compute f,(z). The 
resulting distribution is shown in the first line of Table 10, and beneath it is 
fis(x), the mean of the distribution f,(z) for 1<i<8. They are obviously very 


TABLE 10 





Function | Source : 2 3 
So(xz) Cale. fm. Sge(z) of T 1 » 
Fin(x) Obs. in T 1 .736 14: .069 .028 
Fis(z) Obs. in T 2 .72 . ; .029 
Ssc(z) Cale. fm. fis(x) of T 1 7 42 .062 .023 
Sec(z) Obs. in T 2 77 . 129 .060 .027 











Saei(z) Cale. fm. fis(x) of T 1 ; : .26 .10 

Saer(z) Obs. in 7 2 , , .26 .23 

Sx(2) Cale. fm. fis(x) of T 1 74 047 016 
Se(2) Obs. in T 1 . 753 13% .076 .027 
Sp(2) Obs. in T 2 7 .052 .018 
Sec(z) Cale. fm fis(x) of T'1 ‘ -039 -017 
Sgc(2) Obs. in T 2 ; .033 .012 

















Sz1(z) Cale. fm fis(z) of T 1 , .08 .02 
Sxe1(z) Obs. in T 2 17 -06 
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close, as may be confirmed by chi-square, so that the ninth inning is not an 
unusual one. 

In subsequent calculations fis(z) from T1 was used as the best estimate of 
fo(x). fis(x) calculated from T2 is not significantly different. 

The observed results of Table 2, as separated between complete and incom- 
plete ninth innings, are compared with the predicted functions. The agreement 
is close for the 1576 complete half-innings, but less satisfactory for the 61 in- 
complete half-innings. 

For extra innings, the distributions are calculated on the assumption that 
fe(x) =fis(x), and compared with those observed. There is a significant differ- 
ence from the 302 observations in Table 1, for which less ones and more twos 
and threes were found than predicted. If the calculation is reversed, and fz(x) 
derived from the observed Sz(x), the result shows fze(1)=.061, fe(2)=.119, 
fe(3) =.043, and fz(4)=.019, which is radically different from fis(x) or any 
other f,(x). However, Sz(x) and Sgc(zx) as calculated from fis(x) agree quite 
well with the distributions observed in the 384 extra innings and 331 completed 
extra innings of Table 2. The agreement is poor for the 53 incomplete extra 
innings. 

All cases of disagreement involve incomplete innings and show a deficit of 
one-run scores, suggesting that the neglect of scoring plays resulting in margins 
of more than one run may be partially to blame. 

These results for extra innings are contradictory. However, in view of the 
excellent agreement with the 331 completed extra innings of Table 2, it would 
seem unjustifiable to conclude that extra innings offer any substantially differ- 
ent probability of scoring from the first nine innings. 


LENGTH OF GAME 
The probability that a tenth inning will be necessary is the same as the prob- 
ability that A’s lead will be 0 after nine innings, i.e. G;,9(0). 
The probability that an eleventh inning will be necessary is G;,9(0)G10(0). 
If we group all extra innings together, so that 
fiz) = fe(z) 


and fori >9 
G(x) = Gz(z) 


then the probability that an ith inning will be required is 
G:,9(0)G2(0)*"° fori = 10, 11, 12,---. 
To state the probability in a manner more suitable for comparison with ob- 
served results, if N» games are observed to require a ninth inning, we would 
expect N»G;,9(0) to require a tenth. If N; games are observed to require an ith 


inning (for 1>9), then we would expect N G@z(0) to require an (i+1)th inning. 
The probability that a game will require exactly 7 innings is 


1—G,.(0) fori=9 


and 


G;,9(0)[1 — Ge(0)][Ge)]* for i > 10. 
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DISTRIBUTION OF RUNS SCORED IN ENTIRE GAME 

The distribution of runs accumulated over nine complete innings is F;,9(z). 
However, the distribution of total game scores must include the possibilities 
that the home half of the ninth (19) is not played, or is started but not com- 
pleted, that the game lasts more than nine innings, and that the home half of 
the final extra inning is not completed. 

During the first nine innings, the visiting team has probability F1,9(x) of 
scoring x runs. The probability that the home team will score y consists of four 
terms: 


(a) - F 9(y) 2 F,,9(r) (V win) 


r==y-+-1 


yl 
(b) = Fi,s(y) >> Fi,9(r) (H win without needing H9) 


r=0 


y—1 bec 
(c) = Fis(y — 1) } Fis(r) } fo(s) ( win by one run during anincompleteH9) 


ra=O s=y—r 


(d) = [Fio(y)] (Extra innings are needed). 


If we set Fi,9n(z) =4[Fi,0(x)+(a)+(6)+(c)], this will represent the dis- 
tribution of scores for games decided in ni e innings, for both visiting and home 
team. . 

The distribution of runs scored in each indecisive extra inning is [fz(zx)] . 

In the decisive extra inning, the visitor’s score has distribution fz(x), while 
the home score consists of two terms: 


(©) faz) © fat) WV win) 


rer+1 


(f) fe(t —1) )>fe(z) (Hwininanincomplete half-inning) (zx > 1). 
If we set Fgp(x) =}[fz(z)+(e)+(/)], this will represent the distribution of 
scores for the decisive extra inning. 
The distribution of scores for games decided in 10 innings is then 


Fi ,10p(x) = > [Fio(r)] Fev(x —r) 
r=0 
and for games decided in i>11 innings it is 
Py iv(z) = > Pis-w(r) [fale — 9)’. 
r= 


When these are calculated we can compute the probability that a team will 
score a total of z runs in a whole game, whatever its length, as 


F(z) = Fi,9p(x) + > F iv(z). 


t=10 
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F(x) as computed in this way from the 1958 data is shown in Table 4 and 
Figures 3A and 3B. 


ESTABLISHMENT AND OVERCOMING OF LEAD 


The probability that team A will have established a lead of exactly / by the 
end of the ith inning is G,,,(l), for i<8. The “expected” figures of Table 5 and 
the curves of Figures 3A and 3B are calculated from this function for 7<8. 

The probability that team A will increase its lead by / or more over the ith to 
jth innings, inclusive, is 


Hil) = Gir), fori<j <8. 
r=l 


If team A is exactly / runs behind at the end of the 7th inning, the probability 
that they will win is 


FH izi,9(l +1)+ 1/2Gi41,9()), 


the first term representing a win in nine innings, the second a win in extra in- 
nings, where the assumption is made that when a game enters extra innings each 
team has probability } of winning. The numbers of Tables 6 and 7, and the 
curves of Figures 5A and 5B are computed from this function. 

To calculate the distribution of the winning margin at the end of the game 
it is necessary to allow for the possibilities that H9 will not be needed, and that 
the game will be won in H9 or an HE with the decisive half-inning being 
terminated as soon as a lead of one run is obtained. 

As in the preceding section, the margin at the end of nine innings has a prob- 
ability composed of four terms: 


(a) Gi(l) (V win by J) l>0 
(b) g(—D (H win by / without needing H9) 
(ec) a probability of a—4}[1+G19(0) ] that H win by 1 in the decisive incom- 
plete H9 
(d) a probability of G19(0) that the score is tied. 
For those games which are decided in an extra inning, the probability of a 
finai margin of | is composed of two terms: 
(e) a term proportional to Gze(l) (V win by J) l>0 
(f) a probability of 4 that H wins by /=1. 
Combining these, and normalizing where necessary, we obtain the probabil- 
ity G(J) that the winning margin will be / runs. 
G(0) =0 
ott = hen an necgn 


: G19(0)Ge() 
GD) = Gi() + g(-—) + ——— 
() i9(0) + g(—1) 1 — Gp) 
This is the function used to caleulate the last columns of Table 5, the points 
above “F” on Figures 3A and 3B, and the curve of Figure 4. 
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TABLE 11. CORRELATIONS BETWEEN HALF-INNINGS 
(1000 games of 1959) 








Pair of Half-innings r N 





V1-V2 —0.001 
H1-H2 +0.05 
V1-H1 +0.045 
H1-V2 —0.024 
V7-V8 —0.014 
H7-H8 +0.009 
7-8 —0.002 











APPENDIX C 
CORRELATIONS BETWEEN HALF-INNINGS AND BETWEEN TOTAL SCORES 


A common method of testing for independence between two distributions is 
to compute the correlation coefficient. 

The pair of scores for the visiting team’s first and second innings (V1 and V2) 
for the 1000 games of 1959 show a correlation coefficient of —0.001. The stand- 
ard deviation for small coefficients and large numbers of readings is very 
nearly N~/?=0.032. Table 11 shows several other correlation coefficients, and 
it is evident that none of them are significant. Thus, there is no evidence here of 
any linear correlation between the scores in helf-innings of the same or of the 
opposing team. Absence of linear correlation does not, however, constitute 
proof of independence [3, p. 222]. 

Another method of testing for independence is to draw up conditional dis- 
tributions for particular half-innings in which the scores in other half-innings 
have been below, or above, average. Some of these are shown in Table 12. 
Sy2(z| V1>0) represents the measured distribution of runs in the visitors half 
of the second inning for all cases (out of 1000 games) in which the visitors scored 


TABLE 12. CONDITIONAL DISTRIBUTIONS FOR CERTAIN INNINGS 
(1000 games of 1959) 











z: 0 1 2 


Sy2(a| V1 >0) 31 14 
Exp. 40 16 














Sa2(xz| H1 >0) 20 12 
Exp. 23 10 





Sm(z|V1>0) 15 11 
Exp. 1 14 





Sy2(z| H1>0) 3 
Exp. 7 





Svs(z| V7 >0) +Sus(z| H7 >0) 
Exp. 
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one or more runs in their half of the first inning. “Exp” shows the number that 
would be expected on the null hypothesis that Sy2(z| V1>0) =Sy2(z| V1=0). 
Five such conditional distributions are shown in Table 12, with e giving the 
probability of occurrence of a chi-square as large as observed. No single differ- 
ence between “observed” and “expected” for which “expected” was 10 or more 
showed a chi-square with e<.05. Thus there is no evidence of dependence be- 
tween the half-innings investigated. 

The coefficient of correlation between the total scores of the home and visit- 
ing team for the 1000 games is +0.090+0.032. This is significant at the 1% 
level, and indicates a slight tendency for the two teams to score together. Such 
a tendency would reduce the number of games with large winning margins 
below the predictions of the independent model. Since the number actually ob- 
served is greater than predicted (see Table 5 and Figure 4 for margins of 10 
or more) the small correlation must be overcome by the excess number of 
games with a large total score and an associated large margin. 
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MULTIPLE LINEAR REGRESSION ANALYSIS WITH 
ADJUSTMENT FOR CLASS DIFFERENCES 


M. Davies 
Battersea College of Technology, London 


A method is described for solving by iteration the normal equations 
in multiple linear regression analysis when class differences are present, 
and a proof is given of the convergence of the process. Comparison is 
made with a direct method of solution in which the information matrix 
is adjusted for class differences before it is inverted. 


1. SOLUTION BY DIRECT METHOD 


ET Y be the dependent variate and X;, Xe, - - - , X» the metrical determining 
L variates. The population is supposed partitioned into p mutually exclu- 
sive classes designated by R;, ({=1,---, p), such that R;=1 if the member 
belongs to class i and 0 otherwise. 

The linear regression model to be discussed is of the form 


Y = Bot BiXit-::--+BmXun+ aki +---+ ah, +6 (1) 


where 8», 81, + *, Bm, 01, °° *, @ are unknown parameters and the residual 
term ¢ is a stochastic variable with zero expectation and variance o*. By the 
Markov theorem, best linear unbiassed estimates b; of each 8; and a; of each 
a; are given by minimizing the sample sum of squares 


S(Y — bo — b1X1 — -- - — bnXu — aiR; — - - - — a,R,)* (2) 


where S denotes summation over all the sets of data in the sample, say N sets. 
b; and a; thus satisfy the “normal” equations: 


as 
“4 osm) 


aS 


0a; 


(3) 


More generally, the Markov theorem states that any linear combination of 
the b,’s and a,’s which can be evaluated explicitly in terms of the data from the 
minimizing equations (3) is an estimate (in the best linear unbiassed sense) of 
the corresponding linear combination of the parameters 8;, a;. Such a combina- 
tion is said to be estimable. 

In the present instance, the set of linear equations (3) is singular. This cor- 
responds to the fact that the parameters 8) and a, ---, a, considered indi- 
vidually are not estimable. We can see at once that this must be the case, for if 
the expectation of Y for each set of observations is estimated by 


bo + bi1X1 +--+ + bnXn + a:Ri1 +--+ +,R, 


we get the same set of estimates if bp is increased and each a; decreased by some 
arbitrary fixed amount. Therefore, in order to solve the system (3) for b; and a; 
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it is necessary to add a further linearly independent equation in some or all of 
by and the a;’s. Chew [1] and Kempthorne [3] take 


Pp 
> a = 0. 


i=1 


Rao [4] takes b)>=0, and Suits [5] suggests taking any one specified a;=0. 
However, it is convenient for the present purpose to estimate 8» from the usual 
formula: 


bo = Y - biXi o> 2 ae ee (4) 


where Y=S(¥)/N, X¥;=S(X,)/N, as in the case in which the variation of Y 
due to class differences is ignored. In order that the first equation of the system 
(3) shall reduce to equation (4), the linear equation introduced for the purpose 
of estimability must be 


> n,Q; = 0 (5) 


where n,; is the number of sets of data in the class 7. The resulting normal equa- 
tions then become 


byS(2,") aL ces -t DS (212m) + ayS (x1) + - 2. “po ayS(t1) = S(xy) 


byS (212m) + a + bwS (am) + ayS (rm) + a : + pS (Xm) a S(rny) 
biS(ars) + ++» + bu S(tm) + aims BW) 





byS(x) + Ere + DwS (Lm) + AyNp ae S(y) 
Pp P P 


where S denotes the sample sum over class i and 2;= X;—X;, y= Y—Y. Sys- 
t 


tem (6) may be solved by substituting for a, - - - , ap from the last p equations 
in the first m equations. For this purpose it is convenient to introduce the nota- 
tion S’(x,v;) for the (7, 7)th element of the matrix 


A — GG, 


which we may refer to as the “reduced” information matrix, where A is the 
information matrix 


A= [S(xsx,)], (t, J mI,---, m) 


S(x;) 
o=[: 2 1,+:-,p;j =1,+-+,m). 


n,}/2 


The “reduced” product sums S’(xz,y) of X; with Y are defined by an exactly 
similar process, thus: 
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S(x)S(u) S(x)S() 
———_—— + --- + I : 


ni Np 


Sag) = Sieg) = { 


In this notation the normal equations for the b,;’s assume the familiar form: 
byS’(ay2) + + + + + DS! (tym) = S’ (xy) ) 


, (7) 
byS’ (witm) + + + + + DS’ (tm?) = S’(tmy) } 


the primes denoting “reduced” sums of squares and products. Inverting the 
matrix of numerical coefficients on the left we obtain the variance-covariance 
matrix C= |c;;] of the b,’s. The b,’s and a;’s are then calculated in succession 
from: 


b; = cjyS’ (ayy) + on, + Cin’ (LmY), } o me (8) 


lL 
ay —{S(¥) — nibo — bS(X1) — --- — bnS(Xu)}, G@=1,---,p). @) 


. 
2. SOLUTION BY ITERATION 


The foregoing method of estimating the regression parameters has the ad- 
vantage that the order of the matrix of coefficients to be inverted is the same as 
the number of metrical determining variates. With the advent of electronic 
computers, the original system of linear equations could be solved directly even 
for very large systems, but if desk computer methods are used a reduction of 
the order of the matrix of coefficients is practically essential. 

An interesting problem arises if the existence of class differences is only 
appreciated after the restricted model assuming no class differences has been 
solved. In other words, the problem is to calculate the b; and a; for the unre- 
stricted model from the inverse C* of the information matrix A: 


C=|=A'= [c,;*]. 
A solution of this problem would, of course, also afford an alternative method 


of solving the unrestricted model in the general case. For this purpose, let us 
define a quantity 


y* =y—a,ki—---—a,h, 
so that the first m equations in (6) become 
byS(x1") + ++ + + OwS(eitm) = S(xiy*) 
byS(x1%m) + + + + + DwS (tm?) = S(tmy*). 
On inverting the matrix of coefficients on the left-hand side of (10), the b,’s 
are given by: 
b; = ej*S(aryy*) + + + + + Cim*S(tmy*), (j = 1,---,m) (11) 


and then, as before, bp and a; are given by (4) and (9). Now y* is not known 
a priori. Suppose, therefore, that we replace (11) by the approximation 
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b; -_ eyi*S (xy) a Cim*S(LmY), G = F er ,m) 


which we can calculate. This corresponds to setting all the a;=0 and gives the 
solution for the restricted model. First approximation values to bo and the 
a,’s, say bo and a,;, can then be obtained by applying formulas (4) and (9) 
with b;’s replacing the b,’s, and if the a; are then substituted for the a; in 


S(xyy*) = S(zy) — {aS(X) +--+ + ayS(X)} (12) 


new approximations to S(x,;y*) can be evaluated. These are then substituted in 
(11) to give a better approximation to the b,’s, say b;“, and the whole cycle 
repeated until convergence is obtained. 

In actual practice it is found that, unless there are very large differences from 
class to class, rapid convergence is obtained, and two or three cycles at most 
are sufficient. 


3. PROOF OF CONVERGENCE 


We shall now show that the foregoing solution by iteration is in fact con- 
vergent. Defining the column vectors 


b = {bi,---, ba}, a = {a,,---,a,}, 
h = (S(y),---,S@my)},  & = (Sy), >>> » S(y)} 


the system (6) can be expressed in matrix form as 


a ee 
Fb+ Da=k 


where 
= [S(z,)], @=1,---,pj3j=1,---,m) 
_ [n.6.,], (2, J = l, om. Pp) 
l ifi=j 
bj = ‘ As 
0 if i ¥ J, 
and primes affixed to matrix symbols denote transposed matrices. In the 
(r+1)th approximation for the b,’s this system is replaced by 
Abt) ao F’a“™ =h 
Fb” + Da = a 
which gives 
Ab\+) — F’/D"'Fb® = h — F’D-'k 
whereas the exact solution satisfies 
(A — F’D"'F)b = h — F’D"'k. 
Hence, subtracting: 
A(b\t) — b) = FD“F(6 — b). 
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Let e*+ =b°t+) —b be the error vector in the (r+1)th approximation. Then 


etl) = C*F’D-'Fe™ 
and hence at the nth approximation 
e”) = (C*F’D“'F)"e 
where e is the error vector at the first stage, i.e. in replacing b by b“ =C*h. 


Thus the condition for convergence is that 
(C*G’G)" — 0, 

ie. (AB)"—0 

where B= G’G. Now the matrix A“'B=E, say, can be diagonalized in the form 


E = XAX"! 


asn— ©, 


where A is a diagonal matrix 
Aa 
) a 


X is a non-singular matrix of column vectors x satisfying the eigenvalue equa- 


(E — »\Ix = 0, 
ie. (B — \A)x = 0, 


tion 


I being the unit matrix, and the d,’s are the latent roots of this equation and are 


given by 
|B— rA| =0. 


Since A and B are real, symmetric, positive-definite matrices, the latent 
roots are real and positive. Further, since 
E* = XA*X-1, 
the condition for convergence is that A"—0 as n> ~, 
i.e. all A; < 1, *, mM). 


Now the Rayleigh quotient 
x’ Bx 


x’ Ax 


for any arbitrary non-null vector z, cannot exceed the largest latent root ref. [2] 
which, of course, it equals when x is the corresponding eigenvector. So the con- 


dition for convergence can be put in the form 
x’ Bx 
x’ Ax 
x'(A — B)x > 0 
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for all non-null vectors x. Hence the necessary and sufficient condition for con- 
vergence is that A—B is positive definite. This must be true since A—B is 
the sum of the p individual variance-covariance matrices for the respective 
classes, as follows from the identity 


Pp 


S(X; — Xa)(X; — Xn)} 


P 
= §(X; — X¥)(X; — X¥) -— dS {u(Xa — ¥) (Ra — X)} 
k=1 


connecting the (i, j)th elements of the matrices A—B, A and B, where Xx 
=S (X,)/n;, and the matrix of the elements 
k 


forexample: S(X; — Xa2)(X; — Xp), (i,j =1,---+,m) 


for a set of n, measurements constituting the class k is positive definite by the 
consistency condition for cross-product moments [4, p. 27]. Hence A—B is 
also positive definite. This completes the proof. 


4. TEST OF NULL CLASS-DIFFERENCES 


The iteration method incidentally provides the data for testing whether the 
class differences are real, or simply due to random fluctuations. For, if the hy- 
pothesis of no class differences is true, the model is 


Y = Bo t+ BiX1+ --- + BaXmt+ € 


and under this hypothesis the least squares estimates of the 8,’s are the b;’s. 
We therefore obtain the following analysis of variance: 
Source of variation Sum of squares Degrees of freedom 
Unrestricted regression bS(aiy) + +++ + baS (amy) m+p-—l1 
+ aS(¥) +--> + apS(¥) 
Regression on X;,--+, Xm bOS(tiy) +++ + + dn S(tmy) m 


‘Difference Difference p-1 : 


On the hypothesis that there is no real difference between the classes, and the 
e’s are normally distributed, the ratio 


{ (br — bi )S(aiy) +--+ + (bm — Bm )S(amy) 
+ axS(¥) + ++ + + ayS(¥)}/(p — 1s" 
where 


s? = unbiased estimate of o? 
{S(y2) — byS(ay) — - - - — bmS(tmy) 
— a:$(¥) — --- — a,8(¥)}/(N — m — p) 


is then distributed as the F-ratio on p—1, N —m—p degrees of freedom. 
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5. AMOUNT OF COMPUTATION 


In comparing the direct method of estimating the regression parameters with 
the iteration method, since both methods require the construction of the A and 
F (or G) matrices and one inversion of a matrix of common order, the essential 
difference in the two methods lies in the number of accumulated product opera- 
tions which have to be carried out. In both methods we have to form and par- 
tially sum m? products in calculating the b;’s and mp products in calculating the 
a,’s, but whereas in the direct method these operations are only carried out 
once, in the iteration method they have to be performed » times where + is the 
number of iterations. In addition to this, the direct method requires the reduc- 
tion to the class means which involves the calculation of }pm(m+1) products of 
the X’s in constructing the symmetric matrix G’G, and pm similar operations 
for the cross-products of the X’s with Y, and the iteration method requires the 
formation of mp cross-products in the calculation of the S(x;y*). Thus in the 
direct method we have a total of }pm(m+1)+m?+2mp such operations as 
compared with v(m?+2pm) operations by the iteration method. We can make 
a comparison between the two methods by determining the value of v which 
would make these two expressions equal, and this is given to the nearest integer 
in the following table in the case m= p: 






















No. of iterations for 
m equal amount of effort 
1 1 
5 
10 
15 
25 








m C bo 







In practice the number of iterations required is rarely more than three. 'Thus 
the superiority of the iteration method only emerges for large numbers of vari- 
ates and/or classes. However, as has already been mentioned, if the existence of 
class-differences is only realized after the information matrix has been inverted, 
then the estimation of the effects of class-differences can be made subsequently 
by the iteration method without the necessity for re-inverting the informa- 
tion matrix. 
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A NOTE ON GRIFFIN’S PAPER “GRAPHIC COMPUTATION 
OF TAU AS A COEFFICIENT OF DISARRAY” 


8. M. Suan 
Gujarat University, Ahmedabad, India 


RIFFIN [1] has given a method for the graphic computation of Kendall’s 
G tau. It consists of computing “the number of intersections resulting from 
connecting the two rankings of each member by lines, one ranking having 
been put in natural order,” and computing tau from the number of intersections. 
In the case of ties in both rankings, Griffin suggests the construction of four 
graphs, for each of which the number of intersections is computed. Tau is then 
computed from the sum of the four sets of intersections. 

The purpose of this note is to suggest that only two of the four graphs are 
necessary, say those in which the differences in ranking from that given by 
the natural order are minimized in both sets, and then maximized in both 
sets. If one doubles the total number of intersections found in these two graphs, 
the values of 4s, required for the computation of tau, can be found. That this 
is sufficient can be seen by numbering the columns in the table at the top of 
page 447 and noting that the sum of the entries in columns 1 and 4 equals the 
sum in columns | and 2. 

If one wishes to check the value obtained, the computation may be repeated 
using the other pair of graphs. 
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NOTES ABOUT AUTHORS 


IRMA GLICMAN ADELMAN, 31, was appointed Assistant Professor of Economics 
at Stanford University this year. She had been teaching at Stanford since 1959 following 
earlier teaching assignments at the University of California, Berkeley, where she received 
her Ph.D. in 1955, and at Mills College, Oakland, California. In addition to her article, 
“A stochastic analysis of the size distribution of firms”, which appeared in the Journal, 
December, 1958, Dr. Adelman’s contributions have appeared in the Review of Economics 
and Statistics, Econometrica, and The Economic Journal. She is also co-author (with 
Pepelasis and Mears) of a volume, Economic Development, Analysis and Case Studies, pub- 
lished by Harper Brothers last April. 

EDWARD AMES, 39, joined the Economics Department at Purdue as Associate 
Professor in 1954 and was appointed Professor in 1960. His previous positions include 
periods as Foreign Service Clerk, U. S. Embassy, Moscow; Instructor in Russian, Har- 
vard; and Assistant Professor of Economics, Amherst College. Ames received his Ph.D. 
in Economics from Harvard in 1952. He has written widely in the areas of money, eco- 
nomic stability, and international finance. Concerning his present article with Reiter, the 
authors state, “This paper is the result of a nickel bet between the authors as to what the 
results would look like if the calculations described here were undertaken. The bet was 
made during a weekly coffee hour while Reiter was waiting at Ames’ house for his daughter 
to finish Sunday school. There is no historic record of the exact content of the bet, so it is 
hard to say who won!” 

FRANCIS JOHN ANSCOMBE, 43, is Professor of Mathematics at Princeton Uni- 
versity. Since his recent biographical notes were published in the September, 1958 JASA 
(p. 748), he has spent the 1959-60 academic year with the Department of Statistics, 
University of Chicago, and has published several articles on statistical inference. 

ROBERT LEON BASMANN, 38, will join the staff of the University of Chicago as 
Associate Professor of Economics this fall. Meanwhile, he continues in his position with 
the General Electric Company in Santa Barbara, California. His present article is a con- 
tinuation of his study of statistical inference for simultaneous structural equations. Earlier 
work was reported in the Journal for December, 1960. Further biographical notes may 
be found on p. 752 of the December issue. 

JOSEPH BERKSON, 61, is Head of Biometry and Medical Statistics at the Mayo 
Clinie and Professor of Biometry at the University of Minnesota. Other recent articles by 
Berkson have appeared in the March, 1958 and September, 1960 issues of JAS A. Biograph- 
ical notes may be found on p. 200 of the March, 1958 issue. 

MURRAY BROWN, 31, Assistant Professor of Economics in the Wharton School, 
University of Pennsylvania, has also recently served as a Research Associate of the Com- 
mission on Money and Credit and as a Consultant to the Patent and Trademark Founda- 
tion, George Washington University. He has received a Special. Award for Effective Teach- 
ing from the University of Pennsylvania and holds a Ford Faculty Fellowship for the 
coming academic year. Before taking his Ph.D. in Economics at the New School for 
Social Research in 1956, Brown served as a Research Assistant of the National Bureau of 
Economic Research and held a Hiram J. Halle Fellowship. He also has had articles in 
Econometrica, International Economic Review, The Patent Trademark and Copyright 
Journal, Trends in Economics, and the Economic Journal. 

JACK CAPON, 29, is Senior Project Engineer at Federal Scientific Corporation, 
New York City. He assumed this position in 1959 after receiving his B.E.E. (1953) City 
College of New York, M.S. (1955) Massachusetts Institute of Technology, and Ph.D. 
(1959) Columbia University in Electrical Engineering. He served as a Research Assistant 
in the Research Laboratory of Electronics, M.I.T., 1953-5 and as an Instructor, Electrical 
Engineering Department, Columbia University, 1955-9. 

MAURICE DAVIES, 39, was trained at the Imperial College of Science and Tech- 
nology, London. He received his B.S. in Mathematics in 1948 and his M.S. and Ph.D. in 
Meteorology in 1949 and 1951, respectively. Before taking his present position as Reader 
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in Numerical Analysis at the Department of Mathematics, Battersea College of Tech- 
nology, London in 1959, he was employed as a Statistician and Mathematician by the 
British Central Electricity Generating Board. His previous articles have appeared in elec- 
trical and meteorological journals. 

CAROL BATES EDWARDS, 25, is a statistics graduate student at Iowa State 
University. She has received a B.A. in Economics from Vassar and an M.S. in Statistics 
from Iowa State. In 1955 she served as a Research Assistant at the Yerkes Observatory, 
Lake Geneva, Wisconsin. 

PETER REGINALD FISK, 32, received his B.Sc. in Economics at the London 
School of Economics in 1953. After two years as University Assistant in Statistics at the 
University of Cambridge, he moved to the University of Aberdeen as Assistant in Statis- 
tics and was appointed Lecturer in Statistics in 1955. Fisk’s previous articles have ap- 
peared in Review of Economic Studies and Econometrica. 

ZVI GRILICHES, 30, was appointed Associate Professor of Economics at the Uni- 
versity of Chicago in 1960 upon returning from a year as Research Associate with the 
National Bureau of Economic Research. Earlier notes on Griliches appear on p. 481 of 
the June, 1959 issue of the Journal in connection with his article, “The demand for fer- 
tilizer in 1954: An inter-state study.” 

JOHN GURLAND, 44, is Professor in the Mathematics Research Center, University 
of Wisconsin. From 1952-9 he was Associate Professor and, later, Professor of Statistics 
at Iowa State University. After receiving training in Mathematics and Statistics at the 
University of Toronto, Gurland took his Ph.D. in Mathematical Statistics at the Uni- 
versity of California, Berkeley in 1948. He served a year as Instructor in Mathematics at 
Harvard and three years at the University of Chicago before joining the Iowa State staff. 
He has written widely in the areas of distribution theory, estimation, and biological 
statistics. 

MAX HALPERIN, 43, is a statistician in the Advanced Development Activity, 
Knolls Atomic Power Laboratory in Schenectady. A biographical sketch appears on p. 
213, of our March, 1960 issue in connection with Halperin’s article, “Extension of the Wil- 
coxon-Mann-Whitney Test to Samples Censored at the Same Fixed Point.” 

HARMAN LEON HARTER, 42, has served as Mathematical Statistician at the 
Aeronautical Research Laboratories, Wright-Patterson Air Force Base since 1952. Bio- 
graphical notes appear on p. 752 of the December, 1960 issue in connection with Harter’s 
article, “Circular Error Probabilities,” published in that issue. 

BERNARD LAZERWITZ, 34, was appointed Assistant Professor of Sociology at 
the University of Illinois in 1960 after having served in the Survey Research Center, Uni- 
versity of Michigan since 1955. He majored in Sociology at Washington University and 
took graduate work in this subject at the University of Chicago (M.A., 1952) and the 
University of Michigan (Ph.D., 1958). Lazerwitz’ earlier publications have appeared in 
the American Sociological Review and in Social Forces. The present article developed from 
a feeling that the one Census bulletin to date on religious preference is inadequate for 
many purposes and the suggestions of Professors Angus Campbell and Leslie Kish that 
unanalyzed data in the Survey Research Center files would be worth exploiting. 

GEORGE ROY GZOWSKI LINDSEY, 41, has been Director of the Defence Sys- 
tems Analysis Group, Defence Research Board, Department of National Defence, Canada, 
since 1959. Lindsey studied mathematics and physics at the University of Toronto and 
then served in the Royal Canadian Artillery. He took graduate work in nuclear physics 
at Queen’s University (M.A., 1946) and the University of Cambridge (Ph.D., 1950) and 
joined the Defence Research Board in 1950. He is just finishing a term as Vice-President 
of the Canadian Operational Research Society. Regarding the article on p. 703, Lindsey 
states, “Long association with military problems has encouraged the author to be curious 
and irreverent when ‘well established facts that everyone has always known to be true’ 
are cited in defence of proposed courses of action. The same questions can be asked of 
baseball. Do right-handed batters really hit better against left-handed pitchers? Is the 
batting average completely satisfactory as an index of performance? Study of these prob- 
lems ensnared the author even deeper into investigation of matters discussed in the pres- 
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ent paper. Does the trailing team usually overcome the lead? Is the game never decided 
(a statistician’s ‘never,’ say «=0.02!) until the last out?”. 

JERZY NEYMAN, 67, Director of the Statistical Laboratory of the University of 
California at Berkeley. Further biographical notes appear on p. 753 of v. 55 of the Journal 
in connection with his article, “Indeterminism in Science and New Demands on Statis- 
ticians.” 

PAUL PEACH, 59, is Senior Operations Research Scientist, System Development 
Corporation, Paramus, New Jersey. Before accepting this position in 1959 he served as 
Research Associate, Case Institute of Technology (1956-9); Head of the Assessment Divi- 
sion, United States Naval Ordnance Test Station (1950-4); Associate Professor, North 
Carolina State College (1945-50); Statistician, United States War Production Board 
(1944-5); and Product Control Statistician, United States Rubber Company (1942-4). 
Peach received his A.B. in English from the University of Arizona in 1956 and his Ph.D. 
in Operations Research from Case Institute in 1961. He is the author of Industrial Statistics 
and Quality Control, Edwards and Broughton, 1947. 

JOHN WINSOR PRATT, 30, is Associate Professor of Statistics at Harvard Uni- 
versity. Since his earlier biographical notes appeared (J ASA, September, 1959, p. 697) in 
connection with a previous article, Pratt has become an Associate Editor of the Journal. 
Concerning the circumstances of his present article, he writes, “After hearing once too 
often that expected length was a poor criterion for confidence intervals, it suddenly oc- 
curred to me it was probably related to what I thought was supposed to be a good cri- 
terion. They turned out to be related, whether or not they are good criteria, and a natural 
extension tied in various examples. Very little theory of ‘optimum’ confidence procedures 
seems to exist. This hampers me, as a Bayesian, in criticizing them.” 

STANLEY REITER, 36, has been Professor of Economics and Mathematics at Pur- 
due University since 1959. He was trained at Queens College and the University of Chi- 
cago (Ph.D. in Economies, 1955) and served as a Research Associate, Cowles Commission 
for Research in Economics and as an Instructor and Research Associate, Stanford Uni- 
versity before joining the Purdue staff as an Assistant Professor in 1954. Reiter has writ- 
ten a number of articles, contributions to monographs, and technical reports on activity 
analysis, economic theory, and applications of statistics to economics. 

ELIZABETH L. SCOTT, 48, has served as Associate Professor of Statistics, Uni- 
versity of California, Berkeley since 1957. Prior to 1957, she held a variety of research 
and teaching positions at Berkeley which started with the acceptance of a Research As- 
sistantship in Statistics in 1939. Miss Scott received her Ph.D. in Statistical Astronomy 
from the University of California in 1949. Her writings have frequently appeared in more 
than a dozen scientific and statistical journals. She continues to be primarily interested in 
applications of mathematical statistics to scientific problems, particularly in astronomy, 
medicine, and meteorology. 

SHANTILAL MANGALDAS SHAH, 33, has been serving as Lecturer in Statistics 
at Gujarat University, Ahmedabad, India since 1958. Trained in Mathematics and Sta- 
tistics at Gujarat College and the University of Bombay, he worked briefly at the Sarabhai 
Chemical Works and taught at Maganlal Thakordas Balmukunddas College, Surat and 
Hargovinddas Laxmichand College of Commerce, Ahmedabad and Mafatlal Gagalbhai 
Science Institute, Ahmedabad, before taking his present position. 

RALPH THOMLINSON, 36, has served as Assistant Professor of Sociology, Los 
Angeles State College, since 1959. His earlier experience includes teaching at Denison Uni- 
versity and the University of Wisconsin as well as brief periods with the Metropolitan 
Life Insurance Company and the Bureau of Applied Social Research in New York City. 
Thomlinson studied at Oberlin and Yale before receiving his Ph.D. in Sociology from 
Columbia University in 1960. He is a staff contributor to Sociological Abstracts. 








BOOK REVIEWS 


Sample Design in Business Research. W. Edwards Deming. New York: John Wiley and 
Sons, 1960. Pp. xix, 631. $12.00. 


Lustre Kisu, Survey Research Center, University of Michigan 


yee important and well-written book should have great success with a broad 
audience. It will influence and improve the practice of sampling and will be used 
widely, especially in business research, but also in other areas of survey sampling. 
The book’s aims are clear and are achieved boldly and well: to carry to a broader 
audience the goal and the methods the author has been teaching, preaching as well 
as practicing in research and as consultant to governments, to business and to 
industry. His goal is to bring uncompromisingly the highest standards of statistical 
sampling to the farthest and darkest corners of data collection; his methods are based 
on the simplicity of his “replicated designs” and the clarity and completeness of his 
sampling instructions. The simplicity of these methods provides the necessary tools 
for achieving his goal of bringing high standards of sampling to nontechnicians. “This 
book teaches, I hope, by precept and example, necessary standards of statistical 
practice for today’s needs.” It teaches well, with a vigorous, lucid and direct style. 

The book has several distinct features and virtues, presenting new approaches to 
the field of survey sampling. First, it greatly simplifies many problems of sample 
design by casting them into the mold of his “replicated designs.” These actually 
refer to two rather different designs: one, the tenfold replication of the “Tukey plan;” 


the other, random selection of two primary units per stratum—widely used or ap- 
proximated in sampling practice. This presentation contrasts sharply with the usual 
organization of sampling theory around the random selection of a large number of 
units—a theory seldom practiced in survey sampling. 

Instead of following the usual complexities of multi-stage random selections from 
unequal clusters, the entire book is organized around simple designs for the complex 
sampling problems that need to be solved every day. 


“In respect to new methods, the chief contribution is replication by procedures 
that maintain efficiency, yet facilitate estimation of the standard error and of any 
mathematical bias that exists in the estimator used. The method is essentially 
Mahalanobis’s interpenetrating samples, which he introduced in 1936 in his surveys 
of Bengal. Acting on a suggestion from my friend Professor John W. Tukey in 1949, 
I have simplified the theory and the practice, especially for the estimates of the 
standard error, through replicated selection of sampling units by serial number in 
equal paper zones. The simple invention of selecting the final sampling units by serial 
number draws primary units with probabilities in proportion to their sizes, and erases 
the complex theory for the variance of multistage sampling.” 


As a second distinct feature, the book is organized around the excellent set of case 
studies in Part II. In Chapter 6, we select employees of a factory, then of several 
factories, first as individuals and then in groups; these problems introduce the 
tenfold replication design. In Chapters 7 and 8 and 9 that design, with variations to 
fit the specific problems, is applied very successfully to sampling business establish- 
ments, inventories and accounts. The method involves drawing ten (or 4 or k) samples 
by identical probability procedures. Separate estimates for the ten samples provide 
all the necessary information; their mean is the over-all estimate; the estimated 
variance of that estimate is one-tenth of their variance—or of their range. (This 
formula is used from p. 95 on frequently, but it is never derived and is explained only 
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briefly, with a reference on p. 437 to Mantel. This is a lapse from the author’s prac- 
tice of providing derivations for most of the other formulas used in the book.) The 
estimates presented are typically aggregates, means or ratios, but the method will be 
applied, I think, in the future to other statistics such as regression coefficients. The 
method is simple, repeating the same operation for each subsample and then for the 
entire sample; it should be well adaptable to electronic computers. 

In Chapters 10 and 11 the design of two primary selections per stratum is applied 
first to a sample of a small urban area and then to a larger area. These designs are 
rounded out in Chapter 12 with careful and detailed precedures for creating seg- 
ments and selecting individuals, presented in the clearest and most complete manner. 
Usually actual case studies either make obscure reading or appear only in watered- 
down versions to make them understandable. Deming can present his cases both 
clearly and completely because he simplified his actual designs in the first place. 

The book presents an astounding number of actual sample designs as examples and 
exercises: “Every exercise and every example in the book comes from experience. 
The examples are not all glistening examples of efficiency; they are, instead, examples 
of valid methods used under various restrictions and conditions of difficulty.” In the 
case studies the designs are laid out clearly from beginning to end as they were ac- 
tually done and a complete set of clear instructions accompanies them. These can be 
used directly to solve practical sampling problems facing the nonspecialists; ac- 
countants and auditors especially can find ready solutions for problems of sampling 
accounts, inventories, and establishments. The consulting statistician can use these 
cases to provide his clients with complete procedures and instructions. These avoid 
the passive voice, which baffles the trainee, and achieve clarity and effectiveness 
with the use of the indicative and imperative moods. 

A third feature appears in Part I, presenting “Standards of Statistical Practice” 
in five chapters and 83 pages, which should be useful both to the givers and the re- 
ceivers of statistical advice. Preaching these standards is the book’s aim; practicing 
them has made their author justly famous and has done much to raise statistical 
standards in this country and around the globe. Chapter 3 has 12 excellent and useful 
pages on “The Frame and Elements of a Sampling Plan” followed by the concept of 
the “equal complete coverage,” that Deming uses—in book and in practice—to sep- 
arate sampling errors and problems from the nonsampling sources on the survey. 
“Defense of the equal complete coverage is the responsibility of the expert in the sub- 
ject matter” and the sampler’s judgment does not extend to areas that “would be 
the same whether sampling were used or not.” Deming makes these important dis- 
tinctions very clear—although I disagree with the extreme operationalism of some 
statements, such as: “In place of the nonexistent and undefinable ‘true value,’ we now 
have the result of the equal complete coverage (p. VII)”; “In fact, the word right or 
wrong has no meaning, just as the true value has no meaning.” (p. 247) But these are 
philosophical problems on which the author, reviewer and reader have their own 
free opinions. 

There are some good examples of stating statistical inference in directly useable 
terms in nontechnical language. In dealing with statistical inference Deming pur- 
posely restricts himself to a few concrete and useful statements to avoid burdening 
his nontechnical readers with the difficulties of lengthier and more complete state- 
ments. This unfortunately leads to instances of imputing categorical truth to arbi- 
trary choices of limited applicability. For example (p. 450): “For testing . . . it may 
pay to use 2-sigma limits. But for estimation ... the 3-sigma limits answer this 
question,” with a similar statement on p. 55. On p. 462 on finding a ¢-value of 1.41, 
he says: “That is, there is no reason to assert on the basis of these figures alone that 
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one manufacturer is doing a better job than another”; and two sentences later: “How- 
ever, if I had to make a decision right now, and both manufacturers could supply me, 
and at the same price, I should choose A. This is a case of a decision without definite 
evidence that A will be the better of the 2 manufacturers, or even that one manufac- 
turer is better than another.” Statisticians will consider this an over-simplified ap- 
proach to inference, but I doubt that it will really mislead many readers; rather, it 
should lead to stimulating thought and discussion in the classroom. However, in- 
stead of presenting the reader with an over-simplified version of statistical inference, 
[ would prefer to refer him elsewhere for it. Among these references I should include 
the very recent books using neo-Bayesian inference: there’s one by Schlaifer, and 
another by Raiffa and Schlaifer. These promising methods are new discoveries for 
many of us, including Deming, who joins me in recommending them to you. 

The attempt to be both simple and complete leads to some over-simplifications 
among the many fine pieces of practical advice on statistical standards. Remember 
that this is “directed especially to the needs of the executive in business, and to the ex- 
pert in the substantive field (consumer research engineering, production, accounting, 
psychology, law, ete.) ...,” when viewing critically—as I do—these hasty words 
(p. 26): “However, economy by itself, without improved quality of information, is 
poor argument for sampling. No self-respecting executive or scientist is interested in 
trading savings for reliability.” Or (p. 13): “The statistician’s report to management 
should not talk about probabilities. It will merely give outside margins of error for 
the results of chief importance.” Then two paragraphs down: “The statistician should 
not in my judgment, give advice to the client to tell him what decision he ought to 
make.” All this hangs together with giving practically certain knowledge, using 
3-sigma limits, when the client has practically zero a priort knowledge. Often, how- 
ever, the cost of sampling and of the need for precision force us to accept narrower 
limits than 3-sigmas. Sometimes, furthermore, a priori knowledge derived from other 
sources can aid our inference. I also disagree with the hasty statement on p. 69 about 
calculating the “extreme effects of the nonresponse”: “This procedure very often 
solves the problem.” In most household sampling, refusal and nonresponse are much 
too great to permit the luxury of allowing the extreme effects of nonresponse, as 
Deming will surely agree. 

The analytical use of survey data is neglected here—as it still is elsewhere in 
sampling literature. This neglect occurs despite the opportunity (which Deming 
points out) that the “Tukey plan” of tenfold replication presents for dealing with some 
of these problems; and despite the fact that Deming wrote one of the earliest papers 
specifically calling attention to the problems of analytic studies from samples and a 
chapter on this topic for his first sampling book. The neglect may be explained by 
lack of advances in this area and the continuing preoccupation of samplers with es- 
timating means and totals. 

I disagree with a few minor aspects of this book. For a practical book, there are too 
many scattered references to and worries about the ratio bias in sampling, which is 
not an important practical problem. Also, in a few instances the author follows the 
common practice of substituting o?/n for the variance of the mean, without ‘stating 
explicitly that he is introducing the important assumption of simple random sam- 
pling, which is seldom actually used. I also regret that both examples of Chapter 13 
spread the work of each interviewer over all the subsamples so that (p. 251) “our 
estimate of the standard error is then an estimate of the pure error of sampling, not 
confounded with the differences between the interviewers, which would effect com- 
plete counts and samples alike.” Actually, Deming prefers, as I and others do, de- 
signs that include within the computed standard error the variance between inter- 
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viewers in the field and between coders and other office workers. I must, however, 
note many places of emphasis on and examples for detecting, measuring and con- 
trolling persistent and nonsampling errors. 

I disagree only weakly with the omission of the theory of systematic sampling, 
more strongly with the author’s decision to treat lightly the problems due to cluster- 
ing. He uses the “replicated designs” for dealing with sampling units without de- 
tailed attention to their clustered nature except for brief remarks and the 19 pages of 
Chapter 19. Now, I agree that the usual formal treatment of multi-stage sampling, 
without replacement at each stage, is much too formidable for practical handling 
and that the over-all variance should usually be computable simply from the totals 
for the primary selections. Further, he shows that two primary selections per stratum 
have wide usefulness and simplicity, especially in computing the variances (treated 
also in recent papers by Keyfitz and by me). This book should do much to simplify 
the practice and teaching of valid methods for practical situations. Nevertheless, as 
pedagogical strategy, I believe that the useful simplifications in this book are bought 
partly at the expense of a less thorough understanding of some important theoretical 
aspects of design. 

Let me make it clear that the methods presented yield designs that are not only 
rigorous probability samples but also highly economical. Moreover, Chapter 19 gives 
the theory for economical designs. However, I believe that this one chapter and some 
other references to economy are too brief to provide the full professional training 
needed for seizing every opportunity to economize in diverse situations. Supplemental 
reading from other texts, especially on the analyses of the components of variance 
and of cost factors for different stages of selection, will help to provide that. As in 
any other technical field, a single book can hardly serve both as the best direct prac- 
tical guide for nontechnicians and as the best theoretical treatise. I judge Deming’s 
book to be the best for the former and good, but not the best, for the latter. 

Generally, theory is placed in Part III, to serve the really important exposition in 
Part II of the practice of sampling. This will make the book less favored for the main 
text in courses of sampling techniques in Departments of Mathematics and Statistics; 
for this purpose, the organization of the book, by its very nature, is not as good as its 
individual parts, paragraphs and chapters, which are extremely well written. But 
adding some chapters from Part II would greatly increase the usefulness of those 
courses. I recommend as references for any course in sampling: Chapter 3 on “The 
Frame and Elements of the Sampling Plan,” Chapters 1 and 2 on “Standards and 
Fundamental Ideas,” the excellent Chapter 14 on “Sampling New Materials.” The 
parts on the “Propagation of the Variance” in Chapter 17 and the Poisson series in 
Chapter 18 are interesting, well-written treatments of neglected topics. 

It should prove a fine textbook for practical courses on sampling, particularly in 
business research. The teacher should introduce all along, as Deming suggests, proper 
portions of Part III, of which Chapter 17 on the “Theory of Variances” is the largest 
and most important. Chapter 15, also long and complete, deals with “Stratified 
Sampling.” He should also introduce, I think, a chapter or two from another book to 
treat the theory of clustered and multi-stage samples. 

The amount of useful information in this volume about actual complex sampling 
situations testifies to the energy of the author, who leads a busy and fruitful life. In 
this light and in light of the wealth of material and examples, the extent of errata 
(of which the author has a list) does not seem large. Here follows a complete list of 
corrections of important errors found by either the author or me, preceded by their 
page numbers. 

104: change 700 to 70; 127: change 500 to 50 and 200 to 2,000; 176: change dwellings 
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for i=2 from 6 and 7 to 7 and 6, and make dependent alterations; 178-179: change 
7? to R* and correct z and zi in Table 7: 182: in Table 9 change 84 to 95, 80 to 53, 
23 to 50, .023 to .021 and 0.475 to .0437; 219: change 7200 to 12,497; 272-273: 
change limits to “as k-«”; 319: change 2; to #; and omit part b; 328-329: change 
steps 6 and 9; 330: change steps 8-10; 370: interchange a? and 6? in (33); 377: change 
“uncorrelated” to “correlated” in remark 9; 391: ¢?=n—[f2o02+f7o/7+2f,f,c.0, | 
=n [f2aC2+f7b°C,?+2f.fyabC.,|; 394: in Table 1, in second function of =n 
[k*a°C2+9°b°C,2+2gkabC.,|, in fourth function put n in denominator of both bias 
and variance, in last function use C*,/g; 423: (i.e., if the correlation between the 
populations a; and 6; is high and the regression goes through the origin); 434: in last 
three equations, change minus terms to plus; 439: in last paragraph change “without” 
to “with”; 442: change eq. 26 to Var 2(Ar1+ 22); 443: last brace of proof should be 
squared; 460: in eq. 11 and 12 change powers of e from r, s and ¢t to m, n and (m+n); 
473: change ow to o»?; 483: “We see now that the variance of a sampling plan... . ” 

No library, private or public, on sampling will be complete without this book. 
How about the person who wants to own only one book on sampling? If he wants a 
single book from which he can obtain complete guidance and instructions on most 
practical problems of survey sampling, Deming’s book is perhaps the best. But if he 
wants a single book to study the theory of sampling in a systematic manner, then I 
think one of four other textbooks on sampling remains preferable. 

It is fascinating that each of the existing sampling books is so clearly flavored by 
the individual experiences of the authors; here theory closely follows practice. Specif- 
ically, Deming has been providing sampling consultations to nonspecialists for many 
years and his book is geared to provide good samples with easy procedures and with 
clear instructions. He has in mind chiefly the many people in all pursuits, but es- 
pecially in business research, in accounting, engineering, etc., who must draw samples, 
without becoming specialists in sampling. His simple methods will yield rigorously 
valid probability samples in complex practical survey situations. The short-cutting, 
for simplicity’s sake, of some complex theory, may lead the nontechnical reader to 
spend some extra dollars on shoe leather, but also to reliable yet simple procedures— 
hence likely either to greater savings in statistical fees or to vaster savings in avoiding 
blunders. I agree wholeheartedly with Deming’s aims to “satisfice” rather than to 
optimize in this book and I think he succeeds extremely well. 

The methods Deming advocates are well based in the universally accepted prin- 
ciples of sampling. His drastic simplifications—both of sampling methods and of 
their teaching—will stimulate healthy arguments (making the reviewer’s task dif- 
ficult). Boldly cutting a new path, this will be a much-discussed and a very influential 
book. 


Mathematical Methods in the Theory of Queueing. A. Y. Khintchine. (Translated by 
D. M. Andrews and M. H. Quenouille) New York: Hafner Publishing Company, 1960. 
Pp. 120. $5.50. Paper. 


EMANUEL Parzen, Stanford University 


HE aim of this delightfully written book is “to acquaint the reader with the main 
f pres methods, and different ways of thought which govern the application of 
the theory of probability to the theory of mass service.” The book is thus intended as 
an introduction to, and not as a complete treatment of, queueing theory. It is written 
in the terminology of telephone traffic: calls represent demands for service, con- 
versation length represents service time, and so on. The book is divided into three 
parts. 
Part I (47 pages), entitled “The incoming stream of calls,” studies the probability 
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law of the number of calls z(t) arriving during the period (0, t). Two cases are con- 
sidered: (i) when the number of calls in non-overlapping intervals are independent, 
and (ii) when the intervals between calls are independent identically distributed 
random variables. Khintchine calls the stochastic process { z(t), t>0} a stream. By 
means of several different axiomatic derivations, simple streams (the author’s term 
for Poisson processes) are shown to be realistic models for the incoming stream of 
calls. 

Part II (40 pages) considers “Systems with losses” while Part III (33 pages) con- 
siders “Systems allowing delay.” In both parts, it is assumed that (i) there are 
n(< ) lines (servers), (ii) incoming calls form a simple stream (Poisson process), 
and (iii) the length of conversations are assumed to be exponentially distributed 
(except that in the case of a single server system with waiting Khintchine also con- 
siders other service time distributions). 

In Part {I it is assumed that if there are no free servers a call is lost; the natural 
question to be investigated is the probability that a call is lost (refused). In Part III, 
it is assumed that a call arriving when all lines are busy waits until a line becomes 
free; the natural question to be investigated is the distribution of the waiting time 
for service of an incoming call. 

The results given are all derived by the most elegant methods available. Khintchine 
has explicitly sought to derive in a simple manner the pioneering results of A. K. 
Erlang and K. Palm. In the preface Khintchine writes, “I have tried to make the 
book accessible to anyone who has mastered the main concepts of the theory of 
probability and followed some course, however short, in mathematical analysis. In 
many instances the explanation could have been abbreviated if, instead of introduc- 
ing the full proof of this or that assertion, I had allowed myself to appeal to the results 
of an appropriate theory of probability (Markov chains, random processes, ergodic 
theories). However, wishing to produce a book accessible to as wide as possible a 
circle of readers, I have hardly ever yielded to such a temptation.” 

I heartily recommend this lucid 110-page work to anyone interested in the theory or 
applications of stochastic processes. 


On the Use of Markov Processes in One-Server Waiting-Time Problems and Renewal 
Theory. J. Th. Runnenburg. Amsterdam, The Netherlands (Doctoral thesis at the Uni- 
versity of Amsterdam), 1960. Pp. 139. $3.00. Paper. 


Aveustus J. Fasens, Dartmouth College 


BE classical literature of queueing theory studies models with the intervals be- 

tween the arrivals of successive customers being independent and identically 
distributed positive random variables, i.e., the input being a renewal process. Al- 
though many ergodic theorems for dependent stochastic processes have been avail- 
able for years, the renewal theoretic results (e.g. Blackwell’s theorem) for a sum of 
dependent random variables have not, and these are essential to the study of queue- 
ing models which realistically handle interdependent inputs. 

In this book (which is his doctoral thesis at Amsterdam) Runnenburg begins filling 
a serious gap by obtaining some limiting distributions for a “Markov dependent” 
renewal-type process and showing the existence of limiting distributions for two very 
interesting queueing models. 

After an introductory chapter, the author defines a “dependent arrival-interval” 
queue in which each :nterval between arrivals cf successive customers is allowed to 
depend upon the preceding interval, and a “scheduled arrivals” queue in which 
customers arrive at times 


Mu+Um, m=0, 1 2,,- 
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where yu is a constant and the u,, are successive states of a positive, numerical-valued 
Markov process. (No one is ever early for an appointment!) Both models are one- 
server queues with arbitrary service distributions. It is then shown for each model 
that the limiting distribution of the waiting time, w,, of the nth customer, 


lim Pr{w, < w}, 
no 


exists, under suitable conditions on the interarrival and service distributions. 
In Chapter III, the author studies the Markov-dependent renewal process: a re- 
newal process 
Yan = te++ +Yn 
modified by permitting y;,, to depend on y;. For the traditional renewal process, in 


which the y; are independent with common distribution A(y), three quantities which 
have been studied are “the renewal quantity,” 


Ui) = > A (y), 


where the superscript indicates convolution of A with itself, and the “excess,” v(t), 
and the “shortage,” u(t), which are, respectively, the difference between ¢t and the 
next larger and next smaller values of Y,. Among the results known for these quan- 
tities are the limit theorems: as t— ~, 


Priu(t) <2} > Pr{v(t) <2} > fa ~ Aly) Jay, 


U(t) 1 
ar in — EW (Feller), 


and A 
Uti+h) —U® "Ew (Blackwell). 

The author gives several sets of conditions sufficient to guarantee that these results 
all carry over to the Markov-dependent case. These results are applied to show that, 
in the dependent arrival-interval queue, the virtual waiting time w(t) (the waiting 
time which would have been experienced by a customer arriving at t, had one arrived) 
and w°(t), the waiting time of the last customer to arrive before t, both have limiting 
distributions as t>. 

In a fourth chapter, the author examines Pollaczek’s methods of determining 
waiting time distributions by the solution of integral equations and applies them to 
obtain distributions for some specific examples of his queueing models. 

The author’s style and the organization of the book cannot be recommended for 
clarity. An occasional Remark is helpful in relating the individual steps to the goal, 
but the whole is very hard going. The results are important enough, however, to 
make the struggle worthwhile. 


Some Aspects of Statistical Inference with Applications to Sample Survey Theory. 
Christer Weibull. Gothenburg, Sweden: (Publication number 7 of the Statistical Institute 
of the University of Gothenburg), 1960. Pp. 87. 15 Swedish crowns. 


Leonarp J. SavaGe, University of Michigan 


nis booklet divides roughly into a historical and critical essay on statistical in- 

ference and some remarks on sample survey’s connected with the author’s position 
on inference, which can perhaps be best described succinctly as midway between that 
of Fisher and that of the Neyman-Pearson school. 
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The most striking point concerns a snare (which would not catch a Bayesian) 
somewhat reminiscent of the topics of ancillary statistics and unlucky random 
samples. The common moral is the distinction between the precision an experiment 
should be expected to deliver and the precision it does deliver. 

Suppose a sample of n individuals 7 is drawn from a population of N individuals 
(which may well be clusters) according to known probabilities p;. (To keep to es- 
sentials, Weibull emphasizes sampling with replacement.) Let some quantity z; be 
observed for each individual and consider the problem of estimating x, the population 
total of z;. Plainly 2, the sample average of z;/p; is an unbiased estimate of z, and 
the variance of £ is 


N 2 
Var (2) so ~—s). 
n Pi 


Finally s*, the following sum over the sample, is an unbiased estimate of Var (2) 


‘Poenen Rog. (= ) 
. me) Ps “y* 

Such a sampling scheme is tempting, if, as is sometimes the case in cluster sam- 
pling, one can reasonably hope so to choose the p; as to render Var (#) comparatively 
small (or, more generally, comparatively small per unit of expenditure). But Weibull 
raises an important question about the interpretation of the data. Suppose, to ac- 
centuate the point without losing anything essential, that we literally knew 
o = Var'/*(#). Then £+ 1.960 would certainly be a 95% confidence interval for z. This 
interval might, however, inspire rather little confidence if, for example, the sample 
of p;’s happened to run unusually smali, especially if (as is realistic) we had chosen 
pi in a way that we hoped was correlated with z;. Indeed if the sample p,’s were no- 
ticed to be small, many of us might refrain from proceeding with the actual exami- 
nation of the z,’s dictated by the sample. 

I agree with Weibull that the difficulty is a real one. He does not pretend to say 
the last word on it, and surely neither does this review. Some publications related to 
this point are: Roe Goodman and Leslie Kish, “Controlled selection—a technique in 
probability samplings,” Journal of the American Statistical Association, 45 (1950), 
350-72. Howard L. Jones, “Inadmissible samples and confidence limits.” Journal of 
the American Statistical Association, 53 (1958), 482-90. Frank Yates, “Bases logiques 
de la planification des expériences,” Annales de l’Institut Henri Poincaré, 12 (1951), 
97-112. Frank Yates, “Quelques dévelopments modernes dans la planification des 
expériences,” Annales de l’ Institut Henri Poincaré, 12 (1951), 113-30. 


Historical Statistics of the United States: Colonial Times to 1957. United States Bureau 
of the Census with the Cooperation of the Social Science Research Council. Washington, 
D. C.: Government Printing Office, 1960. Pp. xi, 789. $6.00. 


Lance E. Davis, Purdue University 


teen reader should not be confused by the similarity between the title of this book 
and the earlier Census Bureau publication, Historical Statistics of the United 
States: 1789-1945. The new volume is much more than a mere updating of the earlier 
work. The new Historical Statistics has been so expanded and extended that it repre- 
sents a major new contribution to the study of the American past. The Census Bureau 
broke new paths in 1949 when they decided to publish an extensive collection of 
important historical series; however, the extent of their pioneering was limited by the 
resources available to them. In the new work, they have exploited their previous 
breakthrough and presented a magnificent collection of important data. The earlier 
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volume did much to ease the task of the empirically minded social scientist, but the 
present volume is an absolute necessity for any scholar with such a bent. 

It would be impossible, in such a short space, to review completely the contents 
of this new volume. Instead, I will limit myself to a few general comments on the 
differences between the new work and its predecessor. It would not be wrong to say 
that the new volume is merely an expansion of the old; however, since it represents 
an expansion in three different dimensions, such a statement would certainly be mis- 
leading. The new volume covers a longer period of history; it encompasses a much 
greater range of subjects; and it draws on a much wider variety of sources than did 
the older work. The 1949 publication covered only the period 1789-1945. The new 
volume not only brings the termination date forward to 1957, but, more important, 
also pushes the coverage back through the Confederation and Colonial periods. The 
almost encyclopedic coverage of the first volume has been even further broadened 
to include statistics on consumer expenditure patterns, social security costs and 
revenues, education, crime and its correction, recreation, religion, climate, communi- 
cations, the distribution and service industries, the size and characteristics of the 
business population, the size and distribution of corporate assets, research and devel- 
opment expenditures, the extent of private insurance, and the armed forces and 
veterans’ affairs. Finally, the earlier publication was largely limited to series generated 
by agencies of the government, but the new work has been extended to include the 
published work of private scholars and the unpublished work of certain private 
agencies (e.g., the National Bureau of Economic Research). The result is a volume 
that brings together statistical data that the researcher could spend years in attempt- 
ing to accumulate for himself. 

Since spave precludes a careful enumeration of the changes between this volume 
and its predecessor, two examples may serve te make the reader better aware of the 
increased scope of the new volume. In the case of the balance of payments data, for 
example, the earlier work has relied entirely on the work of Bullock, Williams, and 
Tucker for its estimates in the period before 1918. The new volume includes also 
the findings of Douglass North, Matthew Simon, and Paul Dickens. The work of 
North and Simon is relatively new, and was not available when the first volume was 
published. However, Dickens’ findings were available (as a thesis), but were not in- 
cluded because the Bureau’s resources were inadequate to bring together all of the 
available work. The result of these inclusions is an expansion of the balance of pay- 
ments estimates from a four-period analysis to an annual series, and an extension of 
the coverage from 1850 back to 1820, and a further partial extension to 1790. 

Again, in the section “Money Rates and Security Markets,” the new volume repre- 
sents a much expanded product. The series on municipal bond yields is pushed back 
from 1919 to 1900, that on preferred stocks from 1919 to 1910, and that on common 
stocks (from Alfred Cowles) is supplemented by an independent series from Moody’s. 
In addition, new series on commercial paper and bankers’ acceptances, margin re- 
quirements, stock market credit, and brokers’ loans by group of lenders have been 
added. 

The only criticism I find of the new Historical Statistics, and this is a minor matter, 
indeed, involves the occasional loose use of headings, coupled with a format that 
divorces source footnotes from the data. While the careful researcher should ex- 
perience no difficulties, the casual user may be led astray. For example, series Z336 
bears the title “Index of Wholesale Prices Estimated for the United States: 1720- 
1789” and that series appears to indicate a state of hyper-inflation during the Revolu- 
tionary War (an increase in the index from 78.0 in 1776 to 10,544.1 in 1780—1850- 
59=100). The source footnotes (located eighteen pages away) indicate that the 
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entire series represents only an average of transactions in three cities (New York, 
Philadelphia, and Washington) and that for the war years it includes only a single 
city (Philadelphia). 

Despite this petty criticism, the new Historical Statistics is one of the most impres- 
sive contributions to historical knowledge of the Twentieth Century. The Bureau of 
the Census and its advisory committee under Professor G. Heberton Evans should 
be congratulated for producing a volume that will serve as an anchor for further 
research for decades to come. 


Government Publications and Their Use. Revised Edition. Laurence F. Schmeckebier and 
Roy B. Eastin. Washington, D. C.: The Brookings Institution, 1961. Pp. xi, 476. $6.00. 


Harry VENNEMAN, Bureau of the Budget 


IBRARIANS will welcome the first revision since 1939 of this reference tool, which 

brings up-to-date the descriptions of basic guides to government publications 

and other features which have made the volume a classic in its field since it, was first 
published in 1936. 

Although substantial portions of the text remain unchanged from the previous 
edition, two entirely new chapters have been introduced, dealing with periodicals 
and microfacsimile editions of government publications, and significant new materials 
have been added in every chapter. The chapter on bibliographies, for example, has 
been expanded by about 50 per cent, largely by the addition of new sections on topics 
which have acquired prominence in national affairs in the last two decades, including 
geriatrics and nuclear science. 

The new edition will prove no less disappointing than previous editions to users of 
government statistics who continue to hope for an up-to-date guide comparable to 
Dr. Schmeckebier’s monumental study of The Statistical Work of the National Govern- 
ment, published by the Institute for Government Research in 1925. Statistical pub- 
lications are not dealt with as such, as the volume is not intended to serve as a sub- 
ject guide or catalog and is not organized in terms of subject matter. It may be 
regretted, nevertheless, that the chapter on bibliographies fails to mention the very 
useful “Bibliography of Sources of Statistics” which is included in every issue of the 
Statistical Abstract of the United States, or the “Bibliography of Principal Periodical 
Statistical Publications” which appears in the Budget Bureau’s booklet on Statistical 
Services of the United States Government. 

The usefulness of this volume for the purposes it is intended to serve could have 
been greatly enhanced by a more detailed analytical index. 


Guide to U. S. Government Statistics. John L. Andriot. Arlington, Virginia: Documents 
Index, 1961. Pp. 402. $15.00. 


Joun I. Grirrin, The City College of New York 


H1s is the third edition of a bibliographical guide to publications of the Federal 
Government containing statistical data. It is the purpose of the guide to bring 
order out of the voluminous output of the U. 8. Government, which is by far the 
largest publisher of statistical data in the country. The guide covers over 2,000 
items, including the recurring publications, the statistical series, and the important 
one-time publications of statistical value. As such, it should prove of considerable as- 
sistance to college and business libraires, even if these libraries do not maintain a 
large government document collection. 
The guide consists of two parts, the guide itself arranged by departments and 
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agencies and a detailed subject index. The guide contains a section devoted to each 
of the major departments, for example, the Department of Agriculture listing covers 
95 pages and includes 431 items; the Department of Commerce listing covers 75 
pages and includes 408 items. One feature of the department and agency listing, 
which may prove of value, is a series of organization charts showing the structure of 
the departments. In addition, anticipated publication programs for the reports of 
the 1960 Census are included, together with a summary of the inquiries included in 
each population census. The subject index published in the guide is detailed, with 
each entry coded to show the frequency and type of data which are available for 
each topic. For example, by consulting the subject index under “Dusts and Dusting” 
it is learned that the Agricultural Research Service published a series of value statis- 
tics on an annual basis on spraying and dusting on farms. Reference to the entry on 
“Pilot Balloon Data” shows that an annual and monthly series on data obtained by 
means of pilot balloons is published in the National Summary of Climatological Data, 
issued by the Weather Bureau. 

For each entry contained in the guide, the following bibliographical information 
is included: frequency of publication, availability (by purchase or upon request), 
price if priced, Superintendent of Documents Classification Number, Superintendent 
of Documents Depository Item Number (if applicable), and “keep” or “discard” 
notations. The Superintendent of Documents numbers are of value particularly to 
depository libraries in ordering government publications. The “keep” or “discard” 
notations are for the information of library staffs and refer to the statistical tables 
only. In general, the notation “keep” indicates that the data are not available as such 
in other publications. 

This guide fills a need which is not met by existing publications. While the Monthly 
Catalog, issued by the Superintendent of Documents, lists all government publica- 
tions, it does not separate out the statistical publications nor does it indicate the 
contents of the reports. Therefore, even libraries receiving the Monthly Catalog will 
find this guide of value. The relationship between statistical series issued under 
somewhat similar titles is clarified by this guide. For example, under the category of 
State Finances, the relationship between the data published in the Summary of 
State Government Finances and in the Compendium of State Government Finances 
is made clear. Often this type of explanatory notation can save considerable time for 
those who do not regularly consult these sources. In any guide of these proportions 
some questions may be raised. For example, item 478 County Business Patterns 
indicates that this report has been published since 1943. Was it not since 1946? The 
Uniform Crime Reports published by the Federal Bureau of Investigation (item 
1266) are no longer semiannual but are issued on an annual basis. In addition to the 
annual reports, the series of quarterly releases, which began in 1958, should be in- 
cluded. The next edition of the Guide is scheduled for publication in 1963. 


La Distribuzione dei Redditi. Francesco Brambilla. Milano, Italy: Presso 1’ Universita 
Commerciale L. Bocconi, 1960. Pp. viii, 298. No price listed, paper. 


Paut G. Cuark, Williams College 


- BramBita has provided for Italian-reading statisticians and economists 
a comprehensive review of the vast literature on the distribution of income, pulled 
together by his critical commentary and his individual explorations in the field. His 
survey ranges from Pareto to Arrow and from Gini to Champernowne, and covers 
such diverse topics as data sources, statistical measures of income inequality, dis- 
tributions as stochastic processes, income-consumption relationships, social welfare 
functions, and the marginal utility of income in the face of risk. The emphasis in 
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treating these topics is theoretical and mathematical rather than empirical, though 
there are many tables of illustrative and background data. Throughout the author’s 
exposition is systematic and lucid, and simply as a review his book (though lacking 
an index) will doubtless become a stand-by for Italian graduate students in the years 
to come. 

Particular emphasis is given, not surprisingly, to the work of Pareto and its 
seminal influences in the study of income distribution. (In the bibliography, Pareto’s 
writings are listed first, with roman numerals, followed by other authors in alpha- 
betical order, with arabic numerals!) The question of possible invariance in income 
distributions is a recurring theme. Empirically, Prof. Brambilla’s position is clear: 
statistical proofs of Pareto’s law do not exist, and Pareto’s measure of income in- 
equality gives a deceptive appearance of similarity to distributions which are actually 
diverse. Theoretically, he remains attracted by the idea of a stable income structure, 
and his most distinctive personal contribution is a further exploration of this idea. 

He proposes a model in which the distribution of income is a function of the dis- 
tribution of individual “capacities” and of a diffusion law expressing the economic 
structure. The capacities are expressed as multiplicative combinations of elementary 
aptitudes, in a form which yields a symmetrical but non-normal distribution. In a 
non-stochastic variant of the diffusion law, income (x) is assumed to depend on 
capacity (A, varying from 0 to 1) via an expression of the form z=(A)1/k/(1—A). 
The author incidentally suggests that reduced income inequality in advanced in- 
dustrial societies compared to pre-industrial societies can be interpreted simply as a 
change from 0<_K<1 to K >1.) The resulting income distribution is then derived as 
a log-normal distribution, which the author indicates has frequently been observed 
empirically. Alternatively, the diffusion law may assume that income derives from 
capacity via a specified stochastic process, and a log-normal distribution is also 
shown to be approached as a limit. 


Personal Deductions in the Federal Income Tax. C. Harry Kahn. (National Bureau of 
Economic Research Fiscal Study No. 6.) Princeton, New Jersey: Princeton University 
Press, 1960. Pp. xxix, 237. $5.00. 


Income Tax Fxemptions (An Analysis of the Effects of Personal Exemptions on the 
Income Tax Structure). Michael E. Levy. Amsterdam, Holland: North Holland Publishing 
Company, 1960. Pp. 148. No price listed. 


Amotz Morag, Hebrew University, Jerusalem, Israel 


rn. Kann’s monograph, which belongs to a series of important fiscal studies by 

the National Bureau of Economic Research, is a survey of the historical de- 
velopment of the personal expense allowances in the Federal Income Tax, with special 
attention to changes in their composition, cyclical sensitivity and distribution among 
income groups. There are special chapters on philanthropic contributions, non-business 
tax payments, personal interest payments, medical expenses and casualty losses, and 
on the standard deduction. The study includes an estimate of the quantitative im- 
portance of the personal allowances, their cost in terms of revenues, and their influ- 
ence on the tax structure. The basic conclusions are that on the whole personal de- 
ductions rose from an estimated 8 per cent of reported incomes in 1918-1921 to 13 
per cent in 1953-1956. At the same time there was a considerable change in the com- 
position of these deductions, mainly a striking decline in the relative importance of 
itemized deductions for personal interest payments. The ratio of deductions to re- 
ported incomes in different income brackets was relatively stable in 1956, although 
this stability was achieved by the combination of the declining-with-income rate of 
deductions for medical expenses, interest payments, and the standard deduction, 
and the rising-with-income rate of deduction for philanthropic contributions. The 
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percentage of estimated deductions to total expenditures involved was, in 1954, from 
81-89 per cent for philanthropic expenditures, 41.5 per cent for non-business tax 
payments, 50.9 per cent for personal interest, and 28.1 per cent for medical expen- 
ditures. The loss of revenues due to deductions was about 7 billion dollars, in 1953, 
or just about one-quarter of the actual yield in that year. Mr. Kahn states that the 
same yield could have been achieved, were there no personal deductions, by reducing 
all nominal rates of the tax by 5 percentage points. He does not suggest that this be 
done, for not only may some personal deductions be justified on grounds of ability-to- 
pay-equity, but they may also be clear and legitimate substitutes for additional 
government expenditures. What should be done is just to reconsider in each case 
whether this is the most efficient way to achieve a given goal. 

While the Kahn study is rich with empirical findings, Mr. Levy’s is an example of 
an almost purely deductive work. The subject is that of the income tax exemptions, 
which, in the U.S.A. in 1956, were more than twice as big as the personal deductions. 
Since the exemptions are responsible for much of the effective progression of the 
U. 8. individual income tax, they are certainly worthy of detailed discussions of the 
kind undertaken by Levy. 

Special attention is paid to a comparison between the effects of changes in the size 
of continuing (i.e., ordinary) exemption and the effects of changes in the marginal 
rate structure on incentives to work, coverage of income tax payers, average tax 
rate, and average rate progression. The approach is analytical, with mathematical 
appendices to all chapters, and although the conclusions are not actually surprising, 
they are useful in indicating some pitfalls in a more intuitive approach. 


Trends in the American Economy in the Nineteenth Century. Studies in Income and 
Wealth, Volume Twenty-four. National Bureau of Economic Research. (William N. 
Parker, Editor.) Princeton, New Jersey: Princeton University Press, 1960. Pp. xi, 780. 
$15.00. 


Aurrep H. Conran, Harvard University 


H°*” many generations have passed since the great methodenstreit which ended in 
an upsurge of the historical sense in economic thinking? After that, implacably, 
came statistics and econometrics. And now, after a remarkably short period in which 
a few voices urged measurement—if only of a few basic aggregates—upon the his- 
torians, we find a new methodenfest where once the methodenstreit raged. Economic 
historians rarely come together these days without reassuring themselves of the 
usefulness of statistical records. The present volume is the outgrowth of one such 
meeting. The eighteen papers published here comprised most of the program of a 
National Bureau of Economic Research Conference on Research in Income and 
Wealth in 1957. 

The papers are all attempts to extend the evidence or fill in the lacunae of em- 
pirical knowledge about economic growth in nineteenth-century North America. 
They are variously successful, ranging downward from the heroic—though none of 
them could properly be called pedestrian by anyone even passingly familiar with the 
data from which they began. There are no statistical innovations to report, but there 
is considerable originality and some ingenuity in the search for and the handling of 
blocks and patches of discontinuous numerical evidence. Economic analyses have 
been kept to a minimum and there are neither new explanations offered nor old ones 
put to the test of statistical confrontation. (There are a number of confrontations of 
old numerical assertions, however.) 

Some of the richest and most successful of the studies, like Gallman’s estimate of 
commodity output by industry after 1839, Easterlin’s interregional comparisons, and 
Budd’s analysis of factor shares in private national income, do not go much beyond 
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pointing out simple institutional shifts by way of explanation. Lebergott’s ingenious 
(and sometimes daring) construction of wage trends over the whole century contains 
more interpretation and inference than most of the other studies. His economies is 
simple, running largely in terms of the effect of immigration upon wages and some 
inferences about market imperfections; but his explanations are often speculations 
about what the trends themselves would imply. In one of the most thoughtful and 
interesting essays, O. J. Firestone offers some new interpretations of Canada’s de- 
velopment in the last half of the century. His description of statistical procedures is 
thin, however, and from these descriptions it seemed that his deflation and extrac- 
tion of values-added were extremely rough, even granting the quality of the data. 
There is more economic explanation in the excellent section on Balance of Payments 
(by North and Simon for the U. S., and Penelope Hartland and Firestone, again, 
for Canada). This is probably the most conclusive part of the book, in the sense that 
the statistical series have been brought to the point where they can carry a significant 
weight of interpretation. 

Statistical maturity in studies of the exploratory types which make up this volume 
is to be inferred from the care with which value series are deflated and the success 
with which values-added are isolated. Consistency and explicitness are more import- 
ant here than the introduction of sophisticated techniques. In general, the volume 
rates relatively high on these counts. My criticisms may seem churlish, therefore, 
especially to those familiar with the ground these authors have had to cover; I offer 
it, however, in the spirit of praise, an acknowledgement of the seriousness of the 
tasks they have taken on. In the paper by Towne and Rasmussen, on agricultural 
gross product and investment, the pricing procedures are weak and there is a dis- 
concerting amount of dependence upon extrapolation back from the data for 1870- 
1900. In Budd’s very useful revision of W. I. King’s series, much depends on one’s 
willingness to accept his particular arbitrary imputations to proprietors and slaves. 
(Some arbitrary assumption is necessary, of course.) Ethel D. Hoover has done an 
excellent revision of the consumer price series. In addition to the index she offers, her 
description of her techniques and presentation of the raw materials are extremely 
useful. And she points out that using the retail price index to get a consistent volume 
index would require reweighting, at least for a series of short sub-periods within the 
whole. I have already made grateful use of Anna Jacobson Schwartz’s gross divi- 
dends and interest payments findings, but I wish she had seen fit to estimate yields 
and interest rates on a more systematic basis from her data. 

Finally, I think I have not seen a better set of invited comments than are found 
in these proceedings. The discussions of the Wicker and Cranmer papers, especially, 
by G. R. Taylor and H. H. Segal, respectively, complement and extend their in- 
teresting findings. In addition to his own remarkable piece of detective work, with 
Franklee Whartenby, William Parker did an admirable job of editing such a varie- 
gated volume. At risk of sounding querulous, let me record the wish that he had 
given us a more extensive subject index. 


Trends in Government Financing. Morris A. Copeland. (National Bureau of Economic 
Research, Studies in Capital Formation and Financing, No. 7.) Princeton, New Jersey: 
Princeton University Press, 1961. Pp. xxvi, 210. $5.00. 


Kennetu D. Roossz, Michigan State University, Oakland 


ROFESSOR COPELAND has made an intensive study of trends in indebtedness of the 
federal government primarily since 1890 and of the state and local governments 
primarily since 1910. The study is part of the larger investigation being conducted 
by the National Bureau of Economie Research of trends and prospects for capital 
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formation and financing. Because of the difficulties encountered in identifying real 
capital formation of government units, Copeland concentrates upon their financial 
requirements. Some of his major conclusions are: that through 1954, more than 90 
per cent of the past federal debt resulted from war emergencies and most of the 
remainder from the depression generated expenditures of the 1930’s; that the pro- 
portion of wartime expenditures financed by taxation has increased through each 
successive war, largely due to improved fiscal procedures; that the federal govern- 
ment has gradually come to assume more responsibility for countercyclical meas- 
ures of stability; and that the growth of state and local debt (and federal debt 
to a considerable extent as well) has resulted from the technological revolution and 
the growing complexities of a modern industrial society. 

By extrapolating from the experience of the past, Copeland forecasts that more 
governmental functions will be financed at the federal level. He also thinks it is likely 
that the federal net debt will increase over the course of future business cycles as 
federal policies increasingly are directed toward business stabilization. In his opinion, 
the actual growth pattern of the public debt will depend largely upon international 
developments and the frequency of depressions and recessions. 

In appraising the impact of government financing upon the financial markets, 
Copeland employs the governmental deficit concept in which “Either the excess of 
nonfinancial expenditures over nonfinancial receipts or the increase in net debt meas- 
ures the financial capital requirement” (p. 25). Under this definition, net additions 
to assets of government corporations and trust funds are subtracted from the in- 
creases in government gross debt in order to measure the direct demand of the 
government upon the loan fund markets. Thus, Copeland develops a concept that is 
pertinent to the objective of economic stabilization. In a note to Chapter II, he then 
reconciles this Flow of Funds nonfinancial computation with three other measure- 
ments of government surplus: the budget; the net cash operating income; and the 
national income and product accounts. 

A criticism of the presentation of the findings is that occasionally the data in the 
text are inconsistent with those in the tables. For example, in the text, state and local 
indebtedness, net of sinking funds, was placed at $926 million in 1890, yet Table I 
shows $1,140 million for the same date (p. 8). Also according to the text, state debts, 
net of sinking funds, declined to $211 million in 1890 but Table I shows $230 million. 
(p. 8) Other similar examples might be given. Confusion arises, too, when tables give 
different statistics for the same measurements as in some of the data in Tables 1 and 
24. A minor criticism of a different sort is that Copeland does not identify the initials 
FOF when introducing the FOF nonfinancial computation of the government defi- 
cit. It is doubtful that FOF is so widely understood to be Flow of Funds that the 
abbreviation requires no explanation. 

Two conclusions relating to compensatory fiscal policy are somewhat misleading. 
Copeland overemphasizes the extent to which there were two schools of thought con- 
cerning countercyclical deficit spending in the thirties. Few economists even by then 
were supporting a compensatory budget policy. A second point, more of interpretation 
than of emphasis, is that although Copeland properly characterizes the federal budg- 
ets prior to World War I as reflecting “business cycles in a passive sense only,” 
John M. Firestone in another National Bureau study, Federal Receipts and Expendi- 
tures During Business Cycles, Princeton, 1960, stresses the more significant finding 
that federal budgets nevertheless were mildly contracyclical during the recessions 
and expansions from 1879 to World War I. 

Professor Copeland’s study provides a useful addition to our knowledge of the 
historical developments in federal, state, and local finances and expenditures; his 
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forecast of a further growth in the federal debt also seems credible in view of the 
long-term increase in the role of government in the economy. 


International Trade 1959. The Contracting Parties to the General Agreement on Tariffs 
and Trade. Geneva: United Nations, 1960. Pp. 189. $2.00. Paper. 


O. J. Frrestons, University of Ottawa 


HIS is a most comprehensive and usefui survey of recent trends in international 
trade and of major trade problems faced by both the industrialized countries and 
the less-developed nations of the world. 

The title “International Trade 1959” does not do justice to the stimulating analysis 
and in part challenging forecasts and ideas contained in the report. The casual reader 
may get the impression that this is just another compendium of international trade 
statistics with some terse comments written by a recent university graduate. In- 
stead it is a carefully prepared analysis written by competent and dedicated profes- 
sionals who must believe that one way of getting governments to adopt enlightened 
international commercial policies is to state clearly the trade problems the world 
faces and to suggest approaches along which solutions to some of the problems may 
be found. 

This report “International Trade, 1959” is the seventh issue published since 1952. 
Perhaps the authors might consider adding a subtitle to the 8th issue of their report 
which would tell the interested reader that he can expect a good deal more from it 
than its title implies. One possible sub-title for “International Trade, 1960” could be 
“Progress, Prospects and Problems.” 

The 1959 report is organized in three parts. 

The first part entitled “Recent Trends in International Trade” also goes much 
further than the heading suggests. It deals not only with changes in the structure and 
pattern of world trade during 1959 and recent developments in the U. 8. balance 
of payments with their far-reaching effects on the trading partners of the United 
States, but it also covers progress in international trade, world output and the general 
rate of economic growth in the industrialized and non-industrialized countries during 
the nineteen fifties. 

The economic analysis of what happened in the field of international trade, and 
why, represents just the basis for a take-off by the authors to delve boldly and with 
ingenuity into such intricate subjects as the future prospects of international trade, 
problems of trade expansion, capital needs and import requirements for accelerated 
economic growth of the non-industrialized countries of the world. 

The second part deals with structural changes in international trade, mainly in 
terms of commodity flows and price variations. A separate analysis is provided for 
manufactured products and for primary commodities, the latter including interna- 
tional trade in agricultural products, industrial raw materials and fuels. 

The third part deals with trade problems faced by particular regions of the world, 
including Western Europe, with special emphasis on the E.E.C. (European Economic 
Community) and E.F.T.A. (European Free Trade Association), Latin America with 
a useful distinction between dollar and non-dollar countries, and the overseas sterling 
area covering both the independent dominions and the dependent territories, many 
of which are in the process of acquiring the national elation and economic headaches 
that usually follow in the wake of newly acquired political independence. 

An appendix completes the report. It contains notes on sources of statistics and 
methods of estimation together with a number of detailed tables. Of these the most 
useful is the table portraying the “network of total international trade” covering the 
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years 1938, 1948, and 1950 to 1959. Trade from area exporting to area of destination 
is shown separately for North America, Western Europe (E.E.C., E.F.T.A., total 
and U.K. separately), Japan, Latin America separately for dollar and non-dollar 
areas, overseas sterling area, overseas associates of continental Western Europe, rest 
of the world, excluding Communist countries. 

In a final column the contribution to world commerce by the U.S.S.R., other 
countries of Eastern Europe and Mainland China is recorded covering their trade 
with the rest of the world but not including international trade among the Com- 
munist countries themselves. 

This latter omission is a pity for a comparison of the growth of international trade 
among the Western nations with the volume of international trade among Commu- 
nist countries could be instructive, particularly if we knew more about the reasons for 
the different rates of expansion of international trade and their effect on economic 
growth generally and international interdependence specifically. 

Perhaps we can arouse the interest of the harassed reader in business, the univer- 
sities and in governments to examine “International Trade 1959” more fully by list- 
ing three out of a number of pertinent and thought provoking conclusions. 

1. World trade reached a new record high of $103 billion in 1959, 6 per cent above 
the level in 1958, a recession year in many countries, and 13 per cent over the previous 
peak of 1957. 

Prices of commodities traded declined by 3 per cent from 1957 to 1958, and by a 
further 2 per cent from 1958 to 1959. Hence world trade in 1959 was in volume terms 
8 per cent and 44 per cent ahead over the preceding two years, respectively. 

Two stimulating questions arise. Why are prices of export commodities declining 
when world markets are expanding at a record rate? Why would world trade grow 
at a more rapid rate than total world commodity output? The report does not 
answer these questions specifically but the reader will find plenty of material on these 
subjects enabling him to formulate his own views. 

2. Progress in world trade has been uneven in recent years. Western Europe has 
been making the most rapid progress followed by Japan, while North America has 
been losing ground. Between 1957 and 1959, exports from Western Europe rose by 
$3.1 billion, from Japan by $600 million, from the rest of the world (excluding North 
America) by close to $500 million, while exports from North America declined by 
$2.7 billion. 

This evidence suggests that North America is on the crossroads. The choice is 
either to pursue more positive and dynamic commercial policies or be prepared to 
accept a lesser role in world trade than this Continent has occupied since the end of 
World War II. 

3. The problem of economic growth facing the less developed countries is closely 
linked with their ability to obtain the capital, know-how and machinery and equip- 
ment required to create the necessary productive facilities. The report raises the 
question as to how the less developed nations will finance imports which by 1969 
may reach some $43 billion: 

“The paradox facing the low-income countries is that, while they cannot raise their 
output and real incomes without a substantial growth in imports from industrial 


countries, the industrial countries have been able to raise output and incomes with 
a diminishing proportion of imports from primary producing countries.” (p 53). 


The report concludes: exports of raw materials and agricultural products will not 
be sufficient to enable the less developed countries to pay for necessary imports. 
They must be given increased opportunities to export manufactured products, par- 
ticularly those of a less complex type, to the industrialized nations of the world. 





BOOK REVIEWS 757 


This is international economic statesmanship at its best—taking a very long point 
of view. Perhaps a subsequent issue of “International Trade” will be able to spell out 
how this desirable objective can be achieved. 

A book review customarily includes also some critical observations. The reader 
going over this report may find some important questions on international trade 
problems unanswered, some significant subjects not covered and some conclusions 
controversial. But, before formulating a too critical judgment tle reader may reflect 
and remember the limitations imposed on and the handicaps faced by international 
civil servants serving as they do many political masters with diverse national loyalties 
and economic interests. 

Perhaps two general comments may be appropriate. 

First, the very comprehensiveness of the report makes for difficult reading. The 
three different parts hang together loosely, suggesting the desirability of greater ef- 
fort towards integration of the report when the next volume is written. One possible 
way of improving integration is to analyse international trade developments on a 
subject basis: progress, past, present and prospects; problems and proposals. 

Secondly, on occasions the enthusiasm of the authors seems to get the better of them 
leaving them to blend pious hope with realistic assessment. 

In making a case for greater integration of E.E.C. and E.F.T.A. the following con- 
clusion is put forward in the report: “The two areas are therefore likely to experience 
a more rapid development than if integration had not taken place. If so, this would 
lead to higher imports of primary products from non-European countries, since for 
many primary products the possibilities for supply from European sources are quite 
limited” (pp. 154 and 155). 

In discussing the trade prospects of the non-industrialized countries, the report 
draws on an earlier study of the long term market prospects for the export of primary 
products (“International Trade, 1956”, pp. 17-35). This earlier report suggested 
“that the long-term prospects for imports of raw materials were not very favourable, 
in spite of the high rates of growth in GNP assumed to occur in the industrial countries 
during the twenty years following 1953-55.” The 1959 report offers the conclusion 
that “the growth in actual earnings derived from the exports of primary products to 
industrial markets that may be envisaged during the 1960's is likely to be relatively 
moderate” (p. 55). 

If the less developed countries of the world are not going to be able to sell increasing 
quantities of industrial raw materials and other primary products to a growing 
Western Europe and to the United States, whose natural resources are gradually 
being used up, where are these countries going to sell the bulk of their raw materials? 

Over all, “International Trade, 1959” sets a high standard fer economic reporting 
and presents an incentive for authors of other international studies to do as well or 
better. 


The Economic World Balance. Thorkil Kristensen and Associates. Copenhagen, Denmark: 
Ejnar Munksgaard, 1960. Pp. 377. 46 Dan. Kr. 


Irma ADELMAN, Stanford University 


The avowed aim of the book is to project to 1980 the economic growth of the sev- 
eral regions of the world. Unfortunately, however, the book is in fact a general treatise 
on economic development. 

The approach used for forecasting purposes is to extrapolate past trends in popula- 
tion and labor force, capital accumulation and technical change. The effects of these 
forces on the rate of growth of the net domestic product are then assessed with the aid 
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of a Cobb-Douglas production function. Excluding technical progress, the production 
function is assumed to be subject to diminishing returns in the densely populated re- 
gions of Western Europe, Eastern Europe and Asia and to constant returns elsewhere. 
The resulting forecast annual rates of growth of net domestic product in constant 
prices are 3.00—4.50 for North America, 1.75-3.25 for Western Europe, 3.50—4.50 for 
Oceania, 4.00-5.50 for the U.S.S.R., 2.50-4.00 for Eastern Europe, 3.75~—4.75 for 
Latin American, 3.25-4.25 for the Middle East, 1.75-2.75 for Asia and 2.50-3.50 for 
Africa. Thus, in the short run, inequalities between developed and underdeveloped re- 
gions are projected to increase, while disparities between East (U.S.S.R. and Eastern 
Europe) and West (North America and Western Europe) will narrow. 

As an intelligent, albeit rough, attempt at projection the book fulfills its purpose. It 
would have been desirable, however, to give the reader more information concerning 
the basis used for selecting numerical values for the parameter of the production func- 
tion of each region. From the discussion it is apparent that these parameters are 
purely in the nature of “guesstimates.” Since bench mark capital data and annual net 
investment and labor force figures are available for many countries, one wonders why 
no discernible attempts were made to derive these constants by regression analysis. 
In particular the very important rate of technical progress was estimated simply as a 
single year residual from the production function without technical progress. It there- 
fore includes the effects of all errors of measurement and errors of specification as well. 

As a general discussion of economic development, the book falls quite short of its 
mark. The treatment of various aspects of growth is uneven and the level unsophisti- 
eated. There is a deplorable tendency to bring in and expand upon irrelevant informa- 
tion. Thus, on p. 99 we are informed that, “Among the Tasmanians and Andamanese, 
dogs were not used as domestic animals, whilst the dog used by the Australiforuis, the 
dingo, is a very old and quite useful offshoot from the general and otherwise very 
varied race of domesticated dog to be found all over the world.” 

Often the authors tend to engage in brash generalization without attempting to 
either qualify or defend their position. In support of this criticism, one may offer such 
passages as the discussion of stages of growth on pp. 56-9 and the unproven assertion 
that the main reason for cultural changes “has been immigration by people from 
other cultures. The changes have seldom taken place from internal causes” (p. 213). 
This is contradicted by the contention (p. 214) that “economic and technical factors 
are far more influential in destroying a cultural pattern than any cultural factor.” For 
economists, their discussion of growth models is inadequate. Finally, the internal 
organization of subsections within the various chapters is also, at times, rather poor 
with a marked tendency to jump from one topic to another. 

All in all, while many of the deficiencies of this book could have been remedied by 
better editing, it is the opinion of this reviewer that the authors were ill advised to at- 
tempt to combine a work of this nature with a general discussion of economic develop- 
ment. 


Controlled Clinical Trials. A. Bradford Hill, Editor. (Papers delivered at the Conference 
convened by the Council for International Organizations of Medical Sciences established 
under the joint auspices of UNESCO and WHO.) Springfield, Illinois: Charles C Thomas, 
1960. Pp. vii, 177. $5.00. 


DonaLp MaINnuLanpn, New York University School of Medicine 


‘T HAVE got together a British group of leading clinicians (physicians and surgeons) 
along with a smattering of ourselves to tell them [European clinical investigators] 
of the ‘unethical’ practices we have been up to.” This was the first announcement that 
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this reviewer received regarding the Conference, held in Vienna in 1959, which re- 
sulted in this book. The sentence, contained in a letter from one of the chief organizers 
of the Conference, selected the topic that ought to be placed first in any effort to pro- 
mote the use of controlled clinical trials, and was so placed in this Conference—the 
ethical problem. 

The aim of the week’s Conference was “to build up . . . a clear picture of the British 
concept of the controlled clinical trial and the place that we believe it holds in medi- 
cine.” Among the seventeen British visitors who presented papers, the “smattering of 
ourselves” comprised six who could technically be classified as “statisticians,” while 
the rest were clinicians (or laboratory workers closely associated with clinical work); 
but this classification is somewhat artificial, because in Britain during the past dozen 
years the distinction between statisticians and clinicians, with reference to an under- 
standing of the principles of investigation and inference in clinical trials, has happily 
become rather hazy. 

In this book the principles of investigation are lucidly illustrated by detailed ex- 
amples of researches in the treatment of many kinds of disease—acute infections (the 
common cold and sore throat), pulmonary tuberculosis, rheumatoid arthritis, coro- 
nary thrombosis and cancer. Other papers deal with more general topics, such as the 
construction of comparable groups, criteria for diagnosis and assessment (with an in- 
formative discussion of observer variation), the patient as his own control, the se- 
quential approach, problems of organization, the design of records and follow-up, the 
analysis and presentation of results. Under these headings the aim was not complete 
description of methods, but an indication of principles and potentialities, for the 
benefit of an audience in Europe, where controlled trials are much less familiar to 
clinicians than they are in Britian. 

A much larger audience can now share this benefit; and any reader of this Journal 
who comes in contact with clinical trials would be well advised to read this book, not 
only for his own profit, but so that he can recommend it to any clinical colleagues who 
ask for an informative and readable introduction to the subject. In such a recommen- 
dation to a clinician there is perhaps only one warning that should be given: Let the 
reader not be beguiled by a statistician’s understatement (p. 17) that “however care- 
fully a trial has been planned, occasionally things will go wrong.” Let him rather heed 
a clinician’s words (p. 166): “I would like to emphasize that a controlled clinical trial 
is a serious matter and is not to be undertaken lightly. . . . It is important to get the 
answer right, and this means that an enormous amount of trouble has to be taken in 
the planning, execution and publication of a trial.” Those who have been deeply im- 
mersed in clinical trials, especially in multiclinic trials, find' that Murphy’s Law, 
quoted by theatrical producers and directors, is very descriptive: “If something can 
go wrong, it will.” 


I Processi Stocastici in Statistica. Giuseppe Avondo Bodino. Milano, Italy: Presso ’Uni- 
versita Commerciale L. Bocconi, 1960. Pp. 95. Price not listed, paper. 


_D. L. Burxnotper, University of Illinois 


I THis book are sketched parts of the mathematical theory of finite Markov chains, 
queueing, and stationary time series. In addition, numerical examples are worked 
out illustrating the applicability of the theory. The aim of the author is expository. 
In the discussion of finite Markov chains, some of the key theorems describing the 
asymptotic behavior of both regular and nonregular chains are proved. These results 





1 Mainland, Donald, “The Clinical Trial—Some Difficulties and Suggestions,” Journal 
of Chronic Diseases, 11 (1960), 484-96. 
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are then used to derive, for a particular example, the ultimate age distribution of a 
population of tools, the population being kept at constant size by the replacement of 
those tools which become worn-out or broken. Also, an application of Markov chain 
theory to the distribution of incomes in a population is presented. Here a countably 
infinite number of states are allowed. Although the mathematical theory of this case 
is not discussed, references are given. 

The queueing model examined assumes gamma distributed arrivals and negative 
exponentially distributed service times. The differential-difference equation for the 
process is derived and an analysis of the stationary case is made. Estimation of the un- 
knowns in the model by the method of moments and by the method of maximum 
likelihood is briefly described. No mention is made of newer approaches to estimation 
theory. 

Moving averages, autocorrelation, and spectral distribution theory are briefly in- 
vestigated in the chapter on time series. By way of illustration, an analysis is made of 
the number of rainy days per year, and the annual precipitation, in Bari, Italy, from 
1867 to 1953. 

Social scientists, industrial engineers, and the like, who have a certain degree of 
mathematical sophistication should find this an interesting and useful book. They will 
have to look elsewhere, however, for more than a fragmentary discussion of how esti- 
mation and other decision problems can best be handled in a stochastic process setting. 


Applicazioni Economiche della Teoria dei Grafici. Giuseppe Avondo Bodino. Milano, 
Italy: Presso l’Universita Commerciale L. Bocconi, 1960. Pp. 86. No price listed, paper. 


KenneETH J. Arrow, Stanford University 


His book is a brief and very clear summary of recent developments in the intersec- 

tion of linear programming and the theory of graphs (a type of combinatorial anal- 
ysis). After a brief introduction to matrices and vectors, there is an excellent sum- 
mary of the theory of graphs to the extent relevant to the subsequent analysis. On the 
basis of this, the author summarizes works of Ford, Fulkerson, and Gale on flows 
through networks, the transportation and allied problems (including the optimal as- 
signment problem, Cahn’s warehouse problem, and Orden’s trans-shipment problem) 
by means of combinatorial methods, and finally a criterion for the solvability of Leon- 
tief models due to Bott and Whitin. The exposition is extraordinarily clear and there 
is no single place to find the material he summarizes. There are no statistical problems 
raised in the book. 


Statistische Methoden der Populationsgenetik. Henri Louis Le Roy. Basel, Switzerland: 
Birkhauser Verlag, 1960. Pp. 397. Fr. 67.50. 


Newton E. Morton, University of Wisconsin 


7 first section (0) elaborates about the partitions of phenotypic variance into 
environmental and genetic (additive, dominant, epistatic), interactional, and 
correlative components. The next section (1) extends these results to correlation be- 
tween relatives, including the work of the last several years on epistatic parameters. 
The third section (2) treats analysis of variance as a basis for estimation of genetic 
and environmental components and heritability. The last section (3) covers predic- 
tions of the results of selection and the construction and use of selection indices. The 
calculation of indices by matrix algebra should be useful to agronomists and animal 
breeders, although it is curious that the conventional indirect approach through genet- 
ic variances and covariances is presented without mention of the much simpler and 
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more direct procedure of regressing economic value of progeny on parental pheno- 
types. 

To English-speaking readers the title is rather misleading. Much of the book is con- 
cerned with variance partitions that do not lend themselves to empirical investiga- 
tions, so that this is not statistics in the usual sense. Nor is it population genetics as 
the term is used in Anglo-Saxon countries, but rather a small part of what is usually 
called biometrical genetics, the part associated with variance partition. The reader 
will look in vain for treatment of population structure, balanced polymorphism, muta- 
tion, the results of regular systems of inbreeding, and a host of other important as- 
pects of population genetics. Although some arithmetical examples are given, there is 
little attempt to apply the formulae to actual observations or to discuss critically the 
assumptions involved. Even topics like diallel crosses which clearly fall within the 
area of this volume are treated sketchily or not at all. This book lacks the originality 
and elegance of Malecot, the comprehensiveness of Li, and the sound common sense 
of Mather. 

In general content, LeRoy’s book is similar to Kempthorne’s, An Introduction to 
Genetic Statistics, but is even more limited in scope. The contrast between LeRoy and 
Kempthorne may be illustrated by their treatment of human genetics. Kempthorne 
cavalierly dismisses the subject with a discussion of an old paper by Fisher on meth- 
ods of ascertainment. This is clearly inadequate, but if one were going to represent all 
of human genetics by one paper, this would not be a bad choice of the older literature. 
LeRoy, however, is even more biased in his presentation of human genetics, devoting 
almost all of this section to twin studies. When we consider that twin researchers be- 
come older and fewer with each passing year, that their assumptions are not open to 
test, and that very little has emerged from their work, this emphasis on twin studies is 
fantastic. 

He makes a brief reference te the regression analysis of height by Fisher and Gray, 
which confounds common environment with genetic factors and therefore gives a 
spuriously high heritability. There is no mention of the much more reliable and im- 
portant studies of recent years on quantitative traits in man, including inbreeding 
effects, selection, maternal influence on birth weight, isolate breaking, assortative 
mating, and many others. These have introduced important methods, and the paper 
of Wright on the heritability of intelligence by the method of path coefficients cer- 
tainly deserves discussion for its methodological interest. For those who read English, 
the books of Mather, Li, and Kempthorne are still superior to the present volume. 


An Introduction to Linear Programming and the Theory of Games. S. Vajda. New York: 
John Wiley and Sons, Inc., 1960. Pp. ix, 63. No price listed. 


Raupu E. Gomory, IBM Research Center 


r. Vaspa has already written two concise elemientary presentations of the main 

topics of linear programming and related areas. These are The Theory of Games 
and Linear Programming, Meuthen, 1956, and Readings in Linear Programming, 
Pitman, 1958, (see these reviews, JASA, 55 (1960) p. 294). The present closely re- 
lated volume was developed from a series of lectures on the same subject matter and 
contains two main parts, one on linear programming and one on game theory. The 
linear programming section (37 pages) deals first with the usual transportation prob- 
lera using the condensed simplex calculation and “shadow prices” and then moves on 
to the more general simplex method. This is explained together with the various at- 
tendant possibilities such as an infinite solution, no obvious first feasible solution, etc. 
Degeneracy and cycling are described but for an anti-cycling techniques the reader is 
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referred to Theory of Games and Linear Programming. The dual system of in- 
equalities is then introduced and it is shown that elimination or change of variables in 
the primal problem has an equivalent effect on the dual problem so that the final 
tableau obtained from the simplex method contains both the primal and dual solu- 
tions. This provides an easy proof of the duality theorem of linear programming. The 
next section which deals with games (25 pages) starts off with an explanation of the 
payoff matrix, saddle point, etc. Mixed strategies are illustrated and solved for 
graphically in the zero-sum two person case where one of the players has only two pure 
strategies. To provide a general game solving technique the game problem is con- 
verted into a linear programming problem. This conversion then provides a proof of 
the minimax theorem. 

Nonzero-sum games and many person games are touched on in passing and infinite 
games are illustrated by a Blotto game. The book ends with a separate, i.e. not linear 
programming proof of the minimax theorem. 

The exposition is mainly by way of numerical examples and as the author remarks, 
no mathematical preparation beyond secondary school level is needed. However I 
doubt that anyone with so little background would be able to follow the explanations 
which are often quite compressed. This reviewer prefers the clearer if somewhat 
longer treatments of the earlier volumes. 

The book seems to be virtually free of misprints except for an outburst of seven in 
Appendix I. 


Concentration in British Industry. R. Evely and I. M. D. Little. (The National Institute 
of Economic and Social Research Social Studies No. 16.) New York: Cambridge Univer- 
sity Press, 1960. Pp. xvi, 357. $10.00. 


InmMA ADELMAN, Stanford University 


His study offers a careful analysis and interpretation of the concentration data for 
220 industries collected by the British Board of Trade. It combines a cross section 
analysis for 1951 with a time series comparison over 1935-51. 

After a fairly brief conceptual and methodological discussion, the concentration and 
industry structure in Britain in 1951 is investigated in detail. The industrial organiza- 
tion is revealed to be similar in broad outlines to that which exists in the United 
States. It consists of comparatively few trades with strong monopolistic elements, a 
predominant proportion of oligopolistic industries with and without a competitive 
fringe, and a surprisingly large (33 per cent of firms with 25 per cent of total em- 
ployment) purely competitive sector. 

The plant structure of British industry is examined in order to see to what extent 
size difference between units are due to technological or to pecuniary economies and 
the growth pattern of the leading firms in high concentration industries is also in- 
vestigated. It is found that in the majority of high concentration trades, long term 
demand has been rising and the largest firms have grown primarily by amalgamation, 
merger, and piecemeal acquisition rather than by internal expansion. The persistence 
of high concentration in the face of increasing demand is attributed primarily to eco- 
nomic, technological and legal barriers to entry. 

Next, regression studies of concentration ratios on various independent variables 
are employed in an attempt to account for variations in concentration both between 
industries at the same point of time and within the same industry over time. The au- 
thors have a somewhat unfortunate tendency, however, to base their regression anal- 
ysis upon purely definitional identities. As a result, this portion of their work is some- 
what unsatisfying because the results do not have much theoretical significance. Fur- 
thermore, the rather disappointing nature of the time series regressions serves (as the 
authors recognize) primarily to caution the investigation from drawing inferences 
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from the cross section results. As is well known, cross section studies may appear to 
lead to more significant results than time series studies mainly because the cross sec- 
tional variances of the dependent and independent variables are larger. 

When the 1951 concentration situation is compared to that which existed in 1935, 
it appears that principal product concentration rose in 27 out of 41 industries and fell 
for the rest. Of the factors which might have been expected to contribute to this 
change on a priori grounds, World War II was found to have had no permanent im- 
pact. Technological change, on the other hand, played a very prominent role in in- 
creasing concentration, as did state action in the form of regulation of foreign trade, 
subsidization and (indirectly) nationalization. 

The analysis of concentration is rounded off by a series of 20 case studies of various 
industries. 

All in all, the authors have provided us with a very competent and valuable investi- 
gation into the market structure of British industry. While one might wish for a 
theoretically more meaningful treatment of the problem, this is less a criticism of the 
authors than of the state of the art. Despite this difficulty, however, this book is highly 
recommended for all serious students of industrial organization. 


Methods of Regional Analysis: An Introduction to Regional Science. Walter Isard. (In 
association with David F. Bramhall, Gerald A. P. Carrothers, John H. Cumberland, Leon N. 
Moses, Daniel O. Price, and Eugene W. Schooler.) New York: The Technology Press of 
the Massachusetts Institute of Technology and John Wiley and Sons, Inc., 1960. Pp. 
xxix, 784. $9.50. 


Micuakt J. BRENNAN, Brown University 


ethods of Regional Analysis is the fourth in a series of regional science studies. 

Written in conjunction with six of his students or associates, this book by Walter 
Isard is a companion to his earlier volume, Location and Space-Economy. The earlier 
study as “a general theory of location and space-economy is of little direct use in 
treating concrete problems of reality” (p. vii). The present publication purports to 
supplement the general theory “by techniques of regional analysis which are opera- 
tional”’ (p. vii). 

The book is divided into thirteen chapters, the first of which is an introduction. 
Chapters 2 and 3 describe elementary methods of population projection and migration 
estimation. Extrapolation of trends by various types of graphical and regression 
methods are explained, with perhaps unnecessary laboring over mathematical nota- 
tion even for the beginning student. And the severe shortcomings of each technique 
are pointed out. 

In Chapters 4 and 5 the emphasis switches to accounting. Chapter 4 describes re- 
gional income and social accounts in order to illustrate the importance of monetary 
magnitudes, while Chapter 5 concentrates on flow analysis and interregional balance 
of payments statements. Both chapters, especially Chapter 4, are marked by many 
conceptual and empirical problems, which render the estimates “shaky and somewhat 
fragmentary” (p. 116). 

Up to this point the book contains little, if any, theory in the sense of postulated be- 
havioral relationships. Various theoretical approaches are discussed in Chapters 6 
through 11. Chapter 6 applies the economic theory of business cycles and multipliers 
to regional analysis, concluding with the remark that each approach evaluated yields 
a very imperfect result. Chapter 7 contains the first explicit presentation of an op- 
timizing model in the comparative-cost analysis of industrial location. The most suc- 
cessful methods are found to be those most closely related to conventional economic 
theory. Input-output matrices are explained in Chapter 8 with the aid of hypothetical 
data. Though the difficulties encountered in obtaining voluminous data, in defining 
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industries and sectors, and in the restrictive assumption of constant input coefficients 
are taken into account, the authors’ overall appraisal suggests a promise of positive 
results. Industrial complex analysis—a hybrid technique that borrows from the com- 
parative cost and the input-output methods—is illustrated with empirical data on a 
single case study in Chapter 9. In Chapter 10 linear programming is applied to inter- 
regional problems, but the data is scanty and again hypothetical. The discussion is 
quite general, and it is doubtful that an operational linear programming model could 
be used to solve concrete problems unless radical alterations were made. Gravity 
models, potential models, and spatial interaction models are surveyed in Chapter 11. 
Their common characteristic is the adoption of stochastic behavioral relationships. 
Inherent in these experimental techniques at their present stage of development are 
certain conceptual difficulties, to which the discussion is limited. 

Chapters 12 and 13 conclude the study by indicating the ways in which the various 
methods might be synthesized to provide a unified approach to problems of regional 
analysis. The channels of synthesis are sketched diagrammatically, with arrows, 
boxes and sub-boxes indicating the lines along which integration might proceed. But 
any attempt at actual synthesis has been deferred to a future publication. 

In treating new disciplines a reviewer must guard against a closed mind. It is tempt- 
ing to discount the authors’ efforts simply because the analysis is not as well devel- 
oped as that found in the more established disciplines. Nor can one criticize the book 
for not attaining what it does not set out to attain. It discusses, not statistical esti- 
mates, but the methods by which estimates might be made. 

Nevertheless, to one who judges analytical tools by their empirical verification, the 
net result is not very satisfactory. First, to show that a method is operational is not in 
itself of much value. The sun-spot theory of business cycles is operational, i.e., sub- 
ject to empirical test. Secondly, aside from the absence of systematic statistical tests, 
it is questionable whether some of the methods are testable in practice. The more 
sophisticated ones escape the naiveté of those presented in the early chapters. How- 
ever, either sample data are deficient or the theory contains intangible variables on 
which no data exist. Thirdly, there appear to be as many methods as there are types 
of problems—and there are a host of problems. The synthesis of methods, all of which 
are defective in one respect or another, seems to yield a defective unification rather 
than a whole which is superior to the sum of its parts. One gets the impression that 
the optimistic attitude of the authors is more a fond hope than a realistic promise. 


Labor Commitment and Social Change in Developing Areas. Wilbert E. Moore and Ar- 
nold 8. Feldman, Editors. New York: Social Science Research Council, 1960. Pp. xv, 378. 
$3.75. 


EvGene Sratey, Stanford Research Institute 


| gum on papers prepared for a conference sponsored by the Committee on Eco- 
nomic Growth of the Social Science Research Council, this volume opens with 
four chapters of theoretical framework by the editors, Moore and Feldman, and con- 
cludes with a postscript on “Moot Points in the Theory,” also by the editors. In be- 
tween are fifteen chapters by individual contributors, grouped under “The Organiza- 
tion of Work,” “The Market Matrix,” “Competing Status Systems,” and “Changing 
Social Structures.” The unifying topic is industrial labor commitment, that is, per- 
formance and acceptance of the behaviors appropriate to an industrial way of life. 

There are very few numbers in the book, and they are not subjected to any novel 
statistical procedures. So a review in the Journal might stop here, except that the 
book raised acutely in the reviewer’s mind an issue of basic importance to statisticians 
concerned with social affairs, namely, the relation between theory and empirical evi- 
dence in the development of social science. 
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On this score the impact of the volume is perturbing, despite the fact that it pro- 
vides some suggestive insights and useful bits of evidence. The world certainly needs 
more light on the processes of social change in developing areas, and the ‘ask of our 
sociologists and anthropologists in pushing out the frontiers of scientific knowledge in 
this respect is urgent. The development of a science consists of interplay between 
theoretical structures and empirical observations, and in the planning of the sym- 
posium behind this volume there was a laudable effort to encourage such interplay. 
But somehow it did not come off very successfully. The links between the theoretical 
chapters and the chapters that offer factual evidence seem tenuous and haphazard. 
The theoretical framework chapters annoyed this reader by a sort of inbreeding of 
speculations, focused on citing ideas in the literature but almost devoid of concern for 
considering how they might be tested. The issues noted as controversial in the liter- 
ature are not sharpened into the form: “Now, if this contention is right, we should 
find such-and-such empirical evidence. How about it?” And the more empirically- 
oriented chapters usually go off pretty much on their own; sometimes they offer 
evidence that fits into the theoretical framework, but generally they discuss issues 
somewhat differently pitched. 

To check on this impression, I traced the treatment of one question—that of the 
role of “rewards for work” in labor commitment. Moore and Feldman in their the- 
oretical chapters discuss it briefly, noting different interpretations in the literature on 
the probable importance of financial vs. other incentives in early stages of industrial- 
ization. Cyril 8. Belshaw in his paper alludes to interesting Melanesian experience 
on the weak appeal of earning more money where the variety of things that money 
can do is limited. Peter B. Hammond, in an account of failure to achieve commitment 
of Mossi laborers on a Niger Irrigation Project, has illuminating observations about 
the unpreparedness of the Mossi for money budgeting problems. Peter Gregory in a 
paper on the labor market in Puerto Rico points to a relation between higher wages, 
the incentives of management to improve management practices, and stability of the 
labor force. Bert F. Hoselitz in his commentary discusses imperfections in the labor 
market and persistent differentials in wages. All these observations are pertinent to 
one aspect or another of the problem, but they are scattered through chapters that 
deal with a great variety of other topics, and they are nowhere brought to focus. The 
concluding theoretical chapter, written after the conference, has nothing on wage 
incentives at all. 

Nevertheless, a symposium like this and the report on it do stimulate workers at 
the frontiers of knowledge. 


The Impact of Collective Bargaining on Management. Sumner H. Slichter, James J. Healy, 
and E. Robert Livernash. Washington: The Brookings Institution, 1960. Pp. xv, 982. $8.75. 


Mitton Derser, Institute of Labor and Industrial Relations, University of Illinois 


ee. of the distinction of the senior author, the late Professor Slichter of Har- 
vard University, and the importance of his earlier work, Union Policies and 
Industrial Management, published in 1941, this volume had been awaited with special 
interest by students of labor relations. In some respects the anticipation has been 
justified. The volume is a vast and valuable compendium of facts and ideas about 
present-day collective bargaining at the plant and company level in the United 
States. Apart from the broad economic consequences of collective bargaining, which 
the authors deliberately omitted, the 31 chapters cover on a topical basis (one or two 
chapters to a topic) most of the important issues in the union-management agreement 
—hiring, training and apprenticeship, seniority, work-sharing and layoffs, promo- 
tions, work scheduling, work assignments, subcontracting, make-work rules, techno- 
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logical change, pensions and other employee benefits, methods of wage payments, 
job classification and evaluation, discipline, pressure tactics, grievance settlement 
and arbitration, labor costs, union-management cooperation, management structure, 
and the process of negotiating labor agreements. If any student or practitioner desires 
a clear, well-written, and sophisticated introduction to any of these topics, he now 
has a ready reference source going considerably beyond the text-book level. 

To those who are deeply concerned with the advance of knowledge in this dynamic 
and complex field, however, the volume may be disappointing on at least two counts. 
First is the authors’ deliberate eschewal of theory and the minimal effort at generaliza- 
tion. It is quite amazing to read that “the authors do not believe that the time is ripe 
to push very far with theories of the determinants of industrial relations . . . ” (p. 
6). If, after several decades of research, the field is not ripe for theory-building and 
testing, when will it be? This reviewer has not seen a better answer than that pro- 
vided in 1958 by the authors’ own Harvard colleague, Professor John Dunlop, who 
wrote in the preface to his Jndustrial Relations Systems (p. vi): 

While there is no disposition here to deprecate the gathering of facts of industrial 
relations, and many additional detailed studies are imperative, the field of industrial 
relations today may be described in the words of Julian Huxley: “Mountains of 
facts have been piled up on the plains of human ignorance. . . . The result is a glut 
of raw material. Great piles of facts are lying around unutilized, or utilized only in 
an occasional and partial manner.” Facts have outrun ideas. Integrating theory has 


lagged far behind expanding experience. The many worlds of industrial relations have 
been changing more rapidly than the ideas to interpret, to explain, and to relate them. 


As to the effort at generalization, each of the chapters does devote some attention 
to trends and patterns suggested by the data collected, but the over-all integrated 
analysis is limited to a concluding chapter of only 15 pages in which the authors com- 
ment on some “emerging characteristics” of the last twenty years. As at the outset, 
the dominant emphasis is on the diversity in collective bargaining and the extraor- 
dinary difficulty in generalizing. But the answer to the difficulty surely does not lie 
in sidestepping it. 

The second main source of disappointment is the authors’ failure to utilize statisti- 
cal thinking and techniques in a project involving nearly three years of field inter- 
viewing with about 650 individuals in 150 companies, 25 industry associations, and 
40 unions. Not a single significant table based on the field data is included in the 
nearly 1000 pages. Interview schedules are conspicuously absent. There is no explana- 
tion of the data collection methods. The reason given for the lack of statistical treat- 
ment is “that there appears to be no feasible way of getting a representative sample” 
(of industrial relations practice in American industry). “Material that is significant, 
rather than representative, has been sought” (p. 5). The criticism to be offered, of 
course, is not at the failure to analyze statistically data collected without a statistical 
design but rather at the failure to consider statistical possibilities in this type of 
project. 


Factory Wage Structures and National Agreements. D. J. Robertson. (University of 
Glasgow Social and Economic Studies V) New York: Cambridge University Press, 1960. 
Pp. xi, 260. $7.50. 


Mitton Derser, Institute of Labor and Industrial Relations, University of Illinois 


HIs volume got its direction when the author, in conducting a case study of a 
factory wage structure, discovered that the wages being paid were radically dif- 
ferent from the wage rates negotiated for the industry under the system of national 
collective bargaining which prevails for most industries in Britain. This finding was 
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not novel but it impelled the author to seek a general re-examination and restatement 
of the principles of wage settlement in Britain. Because of the complexity of British 
industry, he decided to focus his study on the so-called “engineering” industry, that 
vast, ambiguous conglomeration of enterprises manufacturing innumerable metal 
products as varied as turbines and wrist watch movements and employing about one- 
third of all manufacturing workers in Britain, and on the closely allied but much 
smaller industry of shipbuilding. The plan was to analyze the wage structures and 
wage determination policies of a small number of establishments and to compare 
them with the policies underlying the national agreements negotiated, respectively, 
by the Engineering and Allied Employers’ National Federation (now the Engineering 
Employers’ Federation) and the Shipbuilding Employers’ Federation with the Con- 
federation of Shipbuilding and Engineering Unions. Mr. Robertson had no illusions 
that his sample of two engineering factories and one shipyard was representative of 
the thousands of establishments throughout Britain, but he believed that even a few 
field studies at this level would illuminate the wage determination process in a man- 
ner which could not be achieved by industry or national wage studies. 

The conclusions drawn from the case studies justify this belief. The author presents 
convincing evidence not only that the wages at the factory level do not conform to the 
national agreements (a fact which Kenneth Knowies of the University of Oxford 
Institute of Statistics and others, including Mr. Robertson, had inferred from de- 
tailed industry statistics) but also that payment-by-results (incentive systems) and 
premium overtime payments have been deliberately employed (and often distorted 
from their proper functions) by managers and workers to give the factory wage struc- 
ture a relation to economic reality which the national negotiations lack. Mr. Robert- 
son therefore recommends that national negotiations be confined to determining 
minimum wages and general work conditions, such as holiday pay, but that the main 
job of wage determination should be conducted at the factory level so that wage deci- 
sions could be made in accordance with product and labor market realities and cur- 
rent misuses of incentive systems and overtime pay could be eliminated. Thus the 
author allies himself with a number of British labor economists, including Professor 
Phelps Brown of the London School of Economics, who have been urging a radical re- 
ordering of their country’s collective bargaining institutions. 

This final conclusion may be of interest to American students of industrial rela- 
tions in view of the trend toward industry-wide bargaining in a number of our major 
industries, although its relevance to the American scene must be qualified by the fact 
that none of our national negotiations relates to such a wide range of products as is 
found in British engineering and by the recognition that national agreements prop- 
erly should allow for supplemental local wage negotiations. 

In general, however, the volume will be of value mainly to wage specialists and to 
students of the British economy rather than to the majority of our labor economists 
because the British system of wage determination (at both the collective bargaining 
and legislative leveis) differs from the American in so many important respects. The 
specialist may find much of the material in the chapters dealing with the principles of 
payments-by-result, overtime, and rewards for skill rather elementary but the concise 
economic rationale for wage differentials in opposition to Barbara Wooton’s case for 
equality is unlikely to be found in American writings, and interspersed among the 
familiar are some incisive comments which will stimulate even the well informed. 
Concluding chapters on wage policy (printed earlier as an article in slightly different 
form) and on wage theory do not add much to the bundle of concepts already avail- 
able in this country. 

The statistical portions of the study are presented simply and clearly. There was 
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no need for highly sophisticated statistical technique and none was introduced. The 
discussion of the data indicated that the author knew what he was about and there is 
no reason to question either their reliability or validity. In recommending improve- 
ments in national policies to control wage inflation, Mr. Robertson includes, without 
specifications, a call for an expansion of government-collected wage statistics to pro- 
vide more detailed information for collective bargaining than the average movements 
of earnings by industry groups. The industry and labor market occupational wage 
surveys conducted by the U. 8S. Bureau of Labor Statistics would appear to be the 
sort of model he has in mind. 


The Research Revolution. Leonard S. Silk. New York: McGraw-Hill Book Company, 
Inc., 1960. Pp. x, 244. $4.95. 


WituraM M. Capron, Stanford University 


HE author accepts the thesis that we have entered an era of “systemized innova- 

tion” since World War II and that the high level of economic growth (4-5 per cent 
per year) we need if we are to win the race—economic and ideological—with the 
Communist world is made possible (though not guaranteed) by our ability to produce 
new ideas at a high rate. Silk covers a very wide range in the compass of this compact 
and highly readable book written in non-technical terms for the layman. Topics 
mentioned include the following: the statistical and analytic difficulties in measuring 
economic growth; a case study of a major post-War innovation (the transistor) and 
the impact of this innovation on the electronics industry; a balanced (though super- 
ficial) review of the “balanced growth” theories of Domar and Harrod, plus a catalog- 
ing of some of the chief alternative views on the stability/growth problem; a review 
of the character and magnitude of R&D in the past 15 years; the growing importance 
of trained manpower and the shortage, particularly of scientists and engineers, 
created by the rapid growth of R&D. The book is introduced by Professor Leontief 
with a brief discussion of certain of the special economic problems involved in inven- 
tion and research. An appendix reproduces the paper by Silk’s McGraw-Hill col- 
leagues, Keezer, Greenwald and Ulin, on the outlook for R&D expenditures in the 
decade ahead. These authors and Silk accept uncritically the available statistics on 
R&D spending and from these figures draw their inferences regarding the rate of 
growth of R&D. While this activity has almost certainly expanded in rea] terms it is 
also almost certain that some fraction of the growth shown by the uncorrected figures 
is spurious and is accounted for, first, by the greatly improved coverage of business 
firms engaged in this activity and second, by the overt recognition of already existing 
activities as “R&D” by many firms in the face of the growing popularity of this ac- 
tivity. While many readers will find this potpourri of topics interesting (to the extent 
they are not familiar with the original sources), I for one am not prepared to accept 
the notion that we have witnessed a “research revolution” which has profound im- 
plications for economic growth, on the basis of the theoretical and empirical informa- 
tion so far available. It seems to me that the evidence such as it is can as well be 
read as showing a gradual evolution from the days of the part-time back-yard in- 
ventor of the 19th century to the large scale industrial laboratory of today. Further- 
more, the increasing scale of R&D and the increasing fraction of our resources de- 
voted to this activity may not be producing economically significant new processes 
and products at a faster rate—the changing character and quantity of R&D may 
simply reflect the increasing effort required, as technology advances, to push it even 
further. 
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Economics of Watershed Planning. G. S. Tolley and F. E. Riggs, Editors. Ames, Iowa: 
Iowa State University Press, 1961. Pp. ix, 339. $3.95. 


J. W. Mituman, Indiana University 


pees volume is a smorgasbord of papers dealing with the economics of small water- 
shed planning. This luncheon will be attractive to persons interested in the 
operational aspects of the planning process. The diet is variegated and action-oriented; 
it will not appeal strongly to readers interested in integrated or specialized analysis. 

Since 1954, when Congress approved the Watershed Protection and Flood Preven- 
tion Act (Public Law 566), the planning and development of local watersheds has re- 
ceived a great deal of attention, particularly from the Soil Conservation Service and 
the U. S. Department of Agriculture. This emphasis on small drainage areas may be 
contrasted with the more comprehensive river basin approach favored by other 
federal agencies. Although these two approaches should be made compatible, it is 
clear that their implementation has led to overlapping and failure to take into ac- 
count obvious interdependencies. In addition, watershed planning should be inter- 
disciplinary and involve the integration of law, engineering, economics, and public 
administration. To examine these various problem areas, the Symposium on the 
Economics of Watershed Planning was held in Knoxville, Tennessee in June, 1959 
under the sponsorship of the Southeast Land Tenure Research Committee, the Farm 
Foundation, and the Tennessee Valley Authority. This book constitutes the pub- 
lication of the proceedings of this symposium; it contains twenty major papers, and 
prepared discussion on seventeen of them. All in all, over fifty persons are listed in the 
table of contents. 

The analysis in this book is uneven, choppy, and somewhat repetitious. Topics 
covered include many aspects of benefit-cost study, water law, a model watershed 
district act, cost-sharing arrangements, organization for watershed planning, case 
studies, potential investment possibilities, and the philosophy of watershed policy. 
Suggestions run the gamut from air-photo mapping to the use of dynamic program- 
ming. With few exceptions the papers show little awareness of new work in capital 
budgeting, formal criteria for investment and multi-period investment approaches. 
Many times basic assumptions as to discount rates and time streams of costs and 
benefits are unspecified. 

At least six papers will be of direct interest to statisticians. George 8. Tolley and 
Ralph A. Freund analyze the sensitivity of flood control benefits to a number of 
alternative economic and hydrologic assumptions in order to derive a probability 
distribution of benefits. I wish that this study could be expanded to include the re- 
sults of alternative interest rates and various time paths of benefits. George Pavelis 
contrasts conventional benefit-cost analysis with linear programming in a study of the 
Nepper watershed of western Iowa. Emery N. Castle does a nice job of illustrating 
the use of activity analysis in a prototype watershed system. Arthur J. Coutu uses 
synthetic land-use models in estimating income and hydrologic effects of alternative 
watershed programs. Earl D. Heady presents a programming model for specifying the 
optimum scale and allocation of investment between alternatives within a watershed. 
Unfortunately, he concludes that the model has little present hope of empirical 
verification. Robert Dorfman attempts to apply programming techniques to a fairly 
simple watershed problem. Even though stochastic elements in the situation are 
ignored, it is clear that simple cases of quasi-linear programming demand engineering 
and economic data not usually available to watershed planners. Dorfman concludes: 


“All that this exercise shows is that the programming approach has survived a simple 
test, like the Wright brothers’ flying machine that flew over a hundred feet” (p. 229). 
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Carroll Wright and Labor Reform: The Origin of Labor Statistics. James Leiby. Cam- 
bridge, Massachusetts: Harvard University Press, 1960. Pp. ix, 242. $4.75. 


Mark Peruman, The Johns Hopkins University 


His is a biography as well as a picture of the evolution of economic statistics col- 

lection in the United States between 1870 and 1905. It is both a story of an in- 
teresting self-educated man’s life and a description of the slow development of sys- 
tematic economic statistics compilation in this country, for Carroll D. Wright, the 
contemporary of Ernst Engel as well as Luigi Bodio and A. L. Bowley, had virtually 
no formal training, but was responsible for the founding and development of what 
is now the Bureau of Labor Statistics. By training he was a lawyer, a product of the 
tradition of reading law in another lawyer’s office; professionally, he was a public 
administrator. 

His introduction to the field of social statistics came as a result of the Republican 
Party’s need to salvage the Massachusetts Bureau of Labor Statistics. That Bureau, 
organized to appease worker sentiments, had “gotten out of hand,” and was exposing 
sore of the Party’s sacred cows. Wright, a lame-duck State senator, was asked to 
renovate the Bureau. Under his direction the Bureau collected considerable important 
social information, but it concentrated on non-controversial topics when it could, 
and de-emphasized its startling findings, when it couldn’t. 

The book is divided into seven chapters. The first two concern Wright’s back- 
ground, education, and political conditioning. They explain his rather loose reformist 
philosophy—it might be considered a mildly Christian Socialist view. 

In the third chapter the author describes the early history of labor statistics in 
America and explains the political configuration which led to the establishment of 
the state bureaus of labor statistics. In the fourth chapter, Professor Leiby considers 
Wright’s work in the Massachusetts Bureau. He notes that Wright was basically 
interested in general demographic data rather than labor statistics. He was not 
particularly interested in describing accurately the distribution of wealth, the variety 
of wages paid, or even such aspects of the population as degrees of skill. 

In the fifth chapter there is a good discussion of the crudeness of Wright’s statistical 
standards. Of course Wright knew nothing of sampling techniques. In it the author 
also discusses Wright’s work at the Bureau of Labor Statistics and his relationship 
with the Bureau of the Census. Indeed, Wright administered the Census of 1890 
and was instrumental in directing the Census of 1900. Neither Census was a poor 
job, although the American Economic Association was vehement in its condemnation 
of the accuracy of data collection in the former. The sixth chapter is devoted to 
Wright’s views on labor relations. 

Wright resigned from the Bureau of Labor Statistics (at that time in the Depart- 
ment of the Interior) in 1905. He was associated briefly with the Carnegie Institution 
of Washington and later became the President of Clark College in Worcester, Massa- 
chusetts. These experiences are analyzed in the seventh and last chapter. 

In retrospect it is easy to underestimate Wright’s achievement. Essentially he was 
an administrator; certainly he was no statistician. The author makes it clear, how- 
ever, that the gathering of social statistics is a function of social development and 
political education as well as the refinement of a discipline. It is unfair to judge 
Wright’s achievements along the lines suggested by the careers of Irving Fisher, 
Engel, or even Emile Lavaseur, because each of these men worked in different milieu. 
Yet, if Wright had had a better technical grasp of his statistical opportunities, there 
is no doubt but what this country’s records would have been improved 30 years be- 
fore they were. In sum, he was far more a gentlemen than a scholar. The last he never 
really pretended to be. 
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The author is on the history faculty at Rutgers University. The book, originally 
submitted as a doctoral dissertation, appears as a polished job. 


Pure Mathematics. F. Gerrish. New York: Cambridge University Press, 1960. Volume I, 
Calculus. Pp. xix, 361. $5.00. Volume II, Algebra, Trigonometry, Coordinate Geometry. 
Pp. xxi, 363-758. $6.50. 


Paut Meyer, Washington State University 


= first thought that crossed my mind on seeing these two volumes, was that an 
error had been made in indicating the subject matter of the two volumes. One is 
simply not used to the idea of introducing the Calculus first, followed by concepts of 
Algebra, Trigonometry and finally Coordinate Geometry. However this is precisely 
the order in which the author takes up these subjects. Although it is claimed that 
Volume II could be taken up independently of Volume I, this is not too practical 
since a large number of references to the Calculus are made in the second volume. In 
addition, Volume II contains all the material on Infinite Series, Thus, this reviewer 
feels that these two volumes could not be adopted very easily to any American cur- 
riculum covering say, the first two years of university Mathematics. However that 
was really not the author’s purpose. Rather, as expressed in the preface, “This two- 
volume text-book in Pure Mathematics has been designed to cover completely the 
requirements of the revised regulations for the B.Sc. General Degree (Part 1) of the 
University of London.” Presumably the preparation of the students for whom these 
books is intended is quite different from that of our students. 

Since the volumes seem not too suitable as texts, the question remains as to their 
use for reference books. I believe that for such a purpose, considerable consideration 
might be given. Volume I contains the traditional topics of the Calculus and its 
applications to areas, volumes, curvature, etc. In addition it contains a chapter on 
Differential Equations. 

The second volume covers the usual topics of polynomial algebra, determinants, 
systems of equations, finite and infinite series, complex numbers (developed quite 
thoroughly) and various topics from theory of equations, analytic geometry, threc- 
space geometry and finally spherical trigonometry. (The Trigonometry referred to in 
the title refers to the latter; knowledge of plane trigonometry is assumed.) 

Both volumes are written carefully and with considerable rigor. The various ideas 
which are introduced are generously illustrated with examples of a non-trivial nature. 
Careful proofs are given throughout. There are a large number of problems at the end 
of each section, many of which are not routine. Answers are provided in the back. 

Just a final comment on the title. It is to be hoped that these two volumes will 
never be confused with G. H. Hardy’s classic Pure Mathematics. In view of the 
priority of title and the generally high esteem and widespread use of Hardy’s volume, 
the choice of title here seems somewhat dubious. 


Quality Control. Fourth Edition. Norbert L. Enrick. New York: The Industrial Press, 
1960. Pp. v, 216. $5.50. 


W. Grant Ireson, Stanford University 


je title page of this edition of Mr. Enrick’s Quality Control bears the subtitle 
“A Manual of quality control procedure based upon scientific principles and 
simplified for practical application in various types of manufacturing plants.” This 
statement tells a large part of the story of this book. Mr. Enrick’s motives are laud- 
able, but he has fallen into the same trap that has engulfed so many others who have 
attempted to write a technical book for the layman. He underestimated the ability 
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of the readers to whom he addressed his book, and tried to write a book about statis- 
tical methods without including any statistics. He practically insults his readers by 
implying that they cannot do simple arithmetic or keep such simple matters as dif- 
ference in sample size (and the effects of those differences) straight in their heads. 
A few examples will serve to illuminate some of the more important shortcomings. 

Typical of the over-simplifications is the statement in Chapter 1, “Here modern 
statistics of mathematical probability helps the inspector by furnishing him with 
ready-made sampling plans that guarantee a minimum amount of inspection for a 
maximum amount of protection against sampling errors.” The italics are Mr. Enrick’s. 
No qualifying assumptions or conditions were stated. This statement was made in his 
discussion of the different inspection plans: Screening, Process Inspection, and Lot- 
by-lot Inspection. The reader is then referred to Chapter 2, “Procedure in Installing 
Lot-by-lot Inspection,” in which is given a “Master Sampling Table.” This is a set 
of sequential sampling plans for “Allowable Per cent Defective” from 0.25 to 12.0% 
and lot sizes from “499 or less” to “100,000 and up.” No attempt was made to define 
the “allowable per cent defective” in a quantitative way. 

Mr. Enrick seems to abhor the use of standard terminology and symbols. His 
“allowable per cent defective” is an example of how he uses terms that will not be 
encountered in common statistical quality control literature. 

The sequential plans are presented first, the author stated, because sequential 
sampling requires “on the average” only half as much inspection as single sampling. 
He did not qualify this statement in any way, although he did point out that for either 
very good or very bad lots, the sequential plans usually accept or reject on the first 
sample. He ignored the reality that more often than not the presented quality level 
is likely to be fairly close to the acceptable quality level, which is the condition that 
tends to cause several sequential samples to be inspected before a decision can be 
reached. 

In the same chapter the author introduced the X chart (without using the symbol) 
and showed both the specifications and the upper and lower control limits. He implied 
that when a chart shows control all the products will meet specifications, and con- 
versely that a process out of control will produce defective articles. This is based on 
his method of computing control limits, which he called “modified control limits.” 

He computed the upper and lower control chart limits for the X chart by Xmax—R 
and Xmin+R, but does not use either of these standard symbols. He calls R the 
“Average Range” at the time he calculates the value, but most frequently refers to it 
as the “process variability.” On p. 36 he said that this method of computing the 
control chart limits is applicable to samples of 3, 4, or 5, but that, “If different sample 
sizes are used, a numerical adjustment (as tabulated below) must be made in de- 
termining the value of the Process Variability before drawing control lines.” He then 
gave the table: 


“Sample Size Multiply Process Variability By” 
2 1.5 


3 
4 
5 
6 
8 


to 10 


The author omitted any discussion of the meaning of “statistical control” and 
ignored the fact that this “modified control” chart does not answer the question, 
“Ts the process in a state of control?” Similarly, he ignored the situation where the 
tolerance is smaller than the natural process variation. 
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The foregoing examples from the first few chapters indicate the level of under- 
standing that the author is trying to impart to his readers. In my opinion this is a 
disservice to the people who may be reading the book in hope of learning something 
about statistical quality control. It is quite obvious that the reader will have to 
“unlearn” many of these ideas before he can understand the value of the statistical 
tools available to him. 

The author waited until Chapter 15 to present the Shewhart Control Chart for 
Variables and introduced it with this paragraph: 

“A sisaple and useful control chart for hardness tests on shovel blades was pre- 
sented in Fig. 5, Chapter 4. As explained in Chapter 4, this chart was constructed by 
deducting the Process Variability (Average Range multiplied by adjutment factors) 
from the maximum specification limit and adding it to the minimum. The resultant 
values, plotted as two lines on the chart, constituted the control limits.” 

Then the authors computed the control limits for the X chart by using the familiar 
X+A.R, but he does not call it that. He gave a table of A» factors under the title 
of “Factors Used in Computing Control Limits.” He used F, instead of Ao. 

His reason for this chapter and this method is rather interesting: “The limits 
(those modified limits calculated from the specifications in Chapter 4) are suitable 
for most uses, especially in piece-parts applications. However, in many of the process- 
ing type of industries, the concept of a natural individual piece of product is often not 
applicable, and consequently there are no specification limits for individual items. 
For example, what yardage length of a large metal coil constitutes a “piece” or “unit 
of product?” How many gallons or pints or other units constitute an individual item 
in a tank car of industrial alcohol? 

“In such cases and in other situations where it is considered desirable, we must 
work from either the Process Average or nominal standard in constructing control 
limits. (It should be noted that the term “Process Average” is often replaced by the 
notation “Grand Average,” to emphasize that it is usually computed from a series of 
sample averages.) When drawn on a control chart, the Grand Average or nominal 
standard appears as the so-called “center line.” 

It would appear from the above quotations that the author is unaware that vari- 
ables control charts are used to control quality characteristics and are not dependent 
upon some “unit of product.” Is a tensile test specimen from a coil of steel wire a 
“unit of product” and is the nature of the sample to be used in evaluating some 
quality characteristic at all significant in deciding whether to set control limits from 
the specifications or from the “Grand Average?” I think not. 

The author included three brief chapters, “Analysis of Variance,” “Statistical 
Tolerancing” and “Optimizing Processing through Evolutionary Operation,” that 
are not ordinarily found in the manual type quality control book. In these chapters 
he presented rather complicated subjects in very simple terms, but with sufficient 
clarity that the ordinary reader will understand the objectives and be able to carry 
out simple analyses by following the instructions given. The author also suggested 
that the reader should recognize the applicability of these techniques to his problems 
and call in a statistician for professional guidance when he does not understand them. 

The organization of the book leaves much to be desired, and the book would have 
been immensely improved if the author had assumed that anyone sufficiently in- 
terested in quality control to read a book on the subject would be intelligent enough 
to use simple algebraic symbols and to recognize that different sample sizes might 
require different “adjusting” factors. In his attempt to write a book about statistical 
tools in non-statistical language, and to assure the reader that he need not understand 
the statistical methods used in developing the tools, the author has ruined what 
might otherwise have been a very useful book for the uninitiated inspector or even 
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for the manager who would like to understand the language spoken by his quality 
control director. 


Basic Statistics for Busir.ess Economics. C. Frank Smith and D. A. Leabo. Homewood, 
Illinois: Richard D. Irwin, Inc., 1960. Pp. xi, 301. $6.00. 


Epwin B. Cox, Boston University 


HIs book covers the topics traditionally included in the first course in statistics 
for students of business or economics—collection and presentation of data, the 
frequency distribution, averages and measures of dispersion, point and interval 
estimation of means and proportions, testing of hypotheses concerning means or 
proportions, simple correlation, time series analysis and index numbers. It is designed 
to be covered in one semester. In general the text is well written and amply supple- 
mented with tables and charts. Each chapter opens with definitions of the symbols 
used for the first time in the chapter and closes with a summary and bibliography 
(unfortunately, not annotated). The tables needed to work any problem in the text 
are included in the Appendix, together with a good review of selected mathematical 
topics including arithmetic, algebra, the use of summation notation and logarithms. 
In the selection and treatment of topics this book is highly conventional. A few 
topics which are treated are occasionally omitted from basic texts and may therefore 
be worth noting. These include a discussion of the use of normal probability paper 
(surprisingly referred to as “arithmetic” probability paper), the geometric and har- 
monic means, the mean and quartile deviations, the “second” or dual regression 
equation and a full algebraic development of the short-cut formulas for computing 
the mean and standard deviation. Otherwise, as the authors state in the Preface, 
“No attempt has been made to present material in a strange or novel manner.” 

The authors have devoted one-half of the text to the treatment of collection and 
presentation of data, the frequency distribution, averages and measures of dispersion. 
These topics are discussed in great detail. In contrast, the treatment of the more 
important and more difficult material on sampling theory, estimation, inference and 
correlation is hurried and appears deceivingly compact. In a one-semester course, 
which this book is designed to serve, the student needs help from his text particularly 
in these phases of the subject which are so new and mysterious to him. This book 
lets him down at just these points. 

A second shortcoming of this work lies in the few problems which appear at the 
end of each chapter. In most cases a set of data is presented, followed by the word 
“Required” and a list of values to be computed by the student. Problems not involv- 
ing calculations tend to be equally unimaginative and of little or no value as teaching 
devices. In the crowded marketplace where beginning statistics texts are offered, 
skillful product differentiation is one of the keys to success. The use of well written, 
imaginative problems is one of the most promising methods of achieving a really 
different product. In this area the authors of this text have let the student down again. 

There are several incorrect or misleading statements which may add further to the 
student’s difficulties. On page 4 one reads, “Defined, a random sample is one in which 
each item in the universe has an equal opportunity of being selected in the sample .. .. 
If some items are more likely to be selected than others, then the sample is biased.” 
These statements are true of simple random samples, but not of probability samples in 
general. On page 73 the reader will find the following, “Then too, the median cannot 
be computed as exactly as can the mean, and its reliability cannol be so readily esti- 
mated.” And on page 75, “In addition, it (the mode) cannot be treated algebraically 
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or tested for reliability.” These statements seem to substitute, incorrectly, the ease 
(or possibility) of calculating reliability for the degree of reliability of these measures. 
On page 104 we find the following puzzling sentence, “And of course skewness is a 
function of grouped data, chiefly.” Skewness may be easier to spot in grouped data, 
but the existence of skewness is a characteristic of the data and not the method 
by which they are presented. 

The student who seeks an answer to the question, “Is S[(2z?/N—1)"?] an un- 
biased estimate of the population standard deviation?” will find the answer “No” on 
page 95, but “Yes” on page 136. This seems an unnecessary question to interject 
into the discussion in any case, since this is a text for a one-semester course. To pro- 
vide the student with contradictory answers is certainly unfair to him. He will be in 
a similar situation if he wonders whether he can correctly apply probability state- 
ments to specific confidence intervals. On page 188 he is told he must not, but on 
pages 144 and 235 the authors do just that. 

Those who bear responsibility for teaching an introductory statistics course in 
three semester hours are in an unenviable position. The scope and importance of 
modern statistical methods applicable to business and economic analysis are so great 
that this allotment of time is insufficient to do justice to the material. In this cir- 
cumstance, the choice of a text for such a course must be made with great care. In 
this reviewer’s opinion, a good text is one which (1) treats the topics which are most 
important, (2) is particularly lucid and thorough in the coverage of those points 
which are generally difficult for the student, (3) contains generous helpings of imag- 
inative problem material, and (4) through allusions in the chapters and annotated 
bibliography assists the better students in extending their knowledge of statistics to 
topics not covered in class. This text by Smith and Leabo does not meet any of the 
latter three tests satisfactorily. 


Evaluation of Drug Therapy. Francis M. Forster, Editor. (Proceedings of the Symposium 
on Evaluation of Drug Therapy in Neurologic and Sensory Diseases held at the University 
of Wisconsin, ‘May 1960.) Madison, Wisconsin: University of Wisconsin Press, 1961. 
Pp. xxiv, 167. $4.00. 


TATISTICIANS will be primarily interested in the three papers on The Planning of a 

Clinical Trial by Seymour Jablon, W. G. Cochran and Shirley H. Ferebee, with 
opening and closing remarks by J. Yerushalmy. The volume also includes papers on 
the Pharmacological Aspects of Testing and a series of Panel Reports on possible 
tests for each of a number of different neurological and sensory disorders. The mem- 
bership of each panel included a statistician, all of whose names are duly listed (al- 
though not always properly spelled) at the front of the volume. Some of the panel 
reports carry internal evidence of the presence of a statistician. J.C. 


Yearbook of Labour Statistics, 1960. International Labour Office. Geneva: International 
Labour Office, 1961. Pages xxiii, 551. $5.00, paper, $6.00 cloth. 


5 pes twentieth issue of this publication follows the same format as the preceding 
ones. Presenting a summary of the principal statistics relating to labor in all parts 
of the world, annual series are given through 1959 and monthly and quarterly figures 
are brought up to June, 1960. Data are provided on employment and unemployment, 
wages and hours of work, consumer prices, family expenditure studies, social security 
and industrial injuries and disputes. Supplementary sections also provide recent 
figures on industrial production, wholesale prices, and exchange rates. R. F. 
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Statistical Tables for Tests of Multivariate Hypotheses. K. C. S. Pillai. Manila: The 
Statistical Center, University of the Philippines, 1960. Pp. viii, 46. No price listed. 


fo volume contains three tables that will be useful in many applications of multi- 
variate statistical analysis. 

Suppose that the characteristic roots of a determinental equation have the joint 
distribution published by Fisher, Hsu and Roy in 1939. Then, Table 1 presents the 
upper five per cent and one per cent points of the distribution of the largest charac- 
teristic root; Table 2 presents the upper five per cent and the upper one per cent 
points of the distribution of the sum of the characteristic roots, and Table 3 presents 
the upper five per cent and upper one per cent points of the sum of the ratios of each 
root to its complement. The range of number of non-zero roots to which all three 
tables apply is from two to eight inclusive. 

There is an introduction in which the uses of the tables are explained and illustrated. 

W.G.M. 


Numerical Methods for Science and Engineering. Ralph G. Stanton. Englewood Cliffs, 
N. J.: Prentice-Hall, Inc., 196i. Pp. xii, 266. $9.00 trade, $6.75 text. 


f bere book is designed as an undergraduate text for students who have a good 
manipulative knowledge of calculus and differential equations. The approach 
is such as to emphasize numerical rather than theoretical results. The development is 
from the standpoint of hand and desk-calculator techniques, and there are numerous 
exercises which form an essential part of the text. The chapter titles are: 1. Ordinary 
finite differences, 2. Divided differences, 3. Central differences, 4. Inverse interpola- 
tion and the solution of equations, 5. Computation with series and integrals, 6. 
Numerical solution of differential equations, 7. Linear systems and matrices, 8. Solu- 
tion of linear equations, 9. Difference equations, 10. Solution of differential equations 
by difference equation methods, 11. The principles of automatic computation. Of 
particular interest to the statistical clientele is a section on least squares approxima- 
tion, the chapters on linear systems which include a discussion of eigenvalue prob- 
lems and ill-conditioned equations, and a section dealing with Mill’s ratio. There are 
also several examples taken from the statistical papers. The abundance of examples 
and the lucid writing should make this a good text; it may alse serve as supplemen- 
tary reading in statistics courses when there is need for numerical methods. I. 0. 


A Bibliography of the Writings of Irving Fisher. Irving Norton Fisher. New Haven, 
Connecticut: Yale University Library, 1961. Pp. xii, 543. $10.00. 


biome aim of this work is to present a complete and definite bibliography of all the 
writings of Irving Fisher—no mean task in view of the man’s extraordinary pro- 
ductivity, prolific imagination, and breadth of interests. The task is greatly com- 
plicated by Irving Fisher’s life-long desire to reach the common man which led him 
to publish extensively in newspapers, popular magazines, and pamphlets. The titles 
are organized chronologically but are classified under seven main headings: (1) 
Economies, (2) General, (3) Health, (4) Monetary Theory, (5) Peace, (6) Syndicated 
by Fisher but written by others, (7) Comments on Fisher by others. The references 
are cross-referenced extensively. Most bibliographies make dull reading; but Fisher 
was such a fascinating personality that his does not. M.N. 


Atlas of Economic Development. Norton Ginsburg. Chicago, Illinois: University of Chicago 
Press, 1961. Pp. vii, 119. $7.50 cloth, $5.00 paper. 


em 48 maps in this atlas present the geographical distributions of many important, 
quantifiable factors affecting economic development and a general level of eco- 
nomic performance. A number of monetary indicators are mapped, including gross 
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national product and several items related to international trade. Among the non- 
monetary variables are yields of agricultural products, energy consumption, petroleum 
refining capacity, railway and road density, and the like. The usual demographic 
indicators will also be found: population density and growth rates, urban-rural dis- 
tribution, literacy, school enrollment, etc. Many new indicators for the level of 
economic development are also presented. Almost all the figures presented are for the 
last half of the 1950’s, and therefore do not describe the changes which have occurred. 
Nonetheless, the atlas should prove a useful supplement to other materials for those 
interested in problems of economic growth and change. M.N. 


Input-Output Analysis of Norwegian Industries, 1954. Per Sevaldson. Oslo, Norway: 
Central Bureau of Statistics, 1960. Pp. 614. Kr. 10,00. Paper. 


_ primary purpose of this work is to provide the data required for application of 
input-output analysis to the Norwegian economy. In addition, the volume con- 
tains an introduction to the theory of input-output and a survey of applications. 
There is a short English summary for the benefit of those who do not read Norwegian. 

M. N. 


Tabies of the Hypergeometric Probability Distribution. Gerald J. Lieberman and Donald 
B. Owen. Stanford, California: Stanford University Press, 1961. $15.00. 


HESE extensive tabulations of point and cumulative probabilities (both to six 
decimal places) of the hypergeometric distribution appear to have been compiled 
with great care and should facilitate more extensive use of the distribution in the 
physical and social sciences and in engineering. A 26-page introduction explains the 
distribution, the construction of the tables (including checks on the accuracy of the 


calculations), some applications, approximations, methods of interpolation, and 
formulae for sums of combinatorials. Probabilities corresponding to all samples from 
finite populations of sizes 1(1)49 and 50(10)100 may be obtained from the tables 
as well as probabilities for selected samples from populations of sizes 100(100)2000. 
There is also a table of log N/ for N =1(1)2000. C. ii, 
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Brown, Kenneth E. and Obourn, Ells- 
worth S. (United States Department of 
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ciates, 1961. $10.00. 

Collier, Raymond O., Jr. and Elam, Stan- 
ley M., Editors. Research Design and 
Analysis. (Second Annual Phi Delta 
Kappa Symposium on Educational Re- 
search) Bloomington, Indiana: Phi Delta 
Kappa, Inc., 1961. $3.50. 

Comune di Roma. Roma—Popolazione E 
Territorio Dal 1860 al 1960. Rome, Italy: 
Comune di Roma, Ufficio di Statistica e 
Censimento, 1960. £5,000. Paper. 

Cox, Richard T. The Algebra of Probable 
Inference. Baltimore, Maryland: The 
Johns Hopkins Press, 1961. $5.00. 

Crow, Edwin L., Davis, Frances A., and 
Maxfield, Margaret W. Statistics Man- 
ual. New York: Dover Publications Inc., 
1960. $1.55. Paper. 

Dehn, Edgar. Algebraic Equations. New 
York: Dover Publications Inc., 1960. 
$1.45. Paper (Reprint: First edition Co- 
lumbia University Press, 1930.) 

De Jonge, H., Editor. Quantitative Methods 
in Pharmacology: Proceedings of a Sym- 
posium Held in Leyden on May 10-18, 
1960. New York: Interscience Publishers, 
Inec., 1961. $13.25. 

Diem, Konrad, Editor. Documenta Geigy 
Wissenschaftliche Tabellen, Sixth Edition. 
Basel, Switzerland: J. R. Geigy, A. G., 
1960. No price listed. 

Donaldson, Elvin F. and Pfahl, John K. 
Personal Finance, Third Edition and In- 
structor’s Manual. New York: The Ron- 
ald Press Company, 1961. $7.50. 

Drown, Frank S. and Monahan, Thomas P., 
Editors. Forty-Sizth Annual Report of the 
Municipal Court of Philadelphia. Phila- 
delphia, Pennsylvania: Municipal Court 
of Philadelphia, 1959. No price listed, 


paper. 
Duncan, Otis Dudley, Cuzzort, Ray P., and 
Duncan, Beverly. Statistical Geography: 
Problems in Analyzing Areal Data. Glen- 
coe, Illinois: The Free Press, 1961. $6.00. 
East African Statistical Department. Fast 
Africa Insurance Statistics, 1950-1968. 


Shs, 3/-. Uganda Census 1959: Non- 
African Population, 1960, 10 Sh. Survey 
of Industrial Production 1958. 5 Sh. 1960 
Statistical Abstract. 10 Sh. Paper. Nai- 
robi, Africa: East African Statistical De- 
partment, 1961. 

Fano, Robert M. Transmission of Informa- 
tion: A Statistical Theory of Communica- 
tions. New York: John Wiley and Sons, 
Inc. and the Technology Press of Massa- 
— Institute of Technology, 1961. 

‘. ° 

Fisher, Irving Norton. A Bibliography of 
the Writings of Irving Fisher. New 
Haven, Connecticut: Yale University 
Library, 1961. $10.00. 

Forster, Francis M., Editor. Evaluation of 
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PUBLICATIONS RECEIVED 


Drug Therapy. Madison: The Univer- 
sity of Waesania Press, 1961. $4.00. 

Forsyth, A. R. Calculus of Variations. New 
York: Dover Publications Ine., 1960. 
$2.95. Paper (Reprint: First edition by 
Cambridge University Press) 

Francis, Roy G. The Rhetoric of Science: A 
Methodological Discussion of the Two-by- 
Two Table. Minneapolis: University of 
Minnesota Press, 1961. $4.75. 

Friend, Irwin and Jones, Robert, Editors. 
Proceedings of the —¢— on Con- 
sumption and Saving, Volumes 1 and 2. 
(Study of Consumer Expenditures, In- 
comes, and Savings) Philadelphia, Penn- 
sylvania: Wharton School of Finance 
Commerce, University of Pennsylvania, 
1960. No price listed. 

Fulks, Watson. Advanced Calculus: An In- 
troduction to Analysis. New York: John 
Wiley and Sons, Inc., 1961. $11.25. 

Gelfond, A. O. Transcendental and Alge- 
braic Numbers. New York: Dover Pub- 
lications Ine., 1960. $1.75. (Reprint: 
Translated from the first Russian edition) 

Ginsburg, Norton. Atlas of Economic De- 
velopment. Chicago, Illinois: University 
Press, 1961. $7.50 cloth, $5.00 paper. 

Graybill, Franklin A. An Introduction to 
Linear Statistical Models, Volume 
New York: McGraw-Hill Book Com- 
pany, 1961. $12.50. 

Guest, P. G. Numerical Methods of Curve 
Fitting. New York: Cambridge Univer- 
sity Press, 1961. $15.00. 

Hancock, Harris. Theory of Maxima and 
Minima. New York: Dover Publications 
Inc., 1960. $1.50. Paper. (Reprint: First 
edition by Ginn and Company, 1917) 

Hanson, Kermit O. and Brabb, George J. 
Managerial Statistics, Second Edition. 
Englewood Cliffs, New Jersey: Prentice- 
Hall, Inc., 1961. $9.25 Trade, $6.95 Text. 

Heady, Earl O. and Dillon, John L. Agri- 
cultural Production Functions. Ames, 
Iowa: Iowa State University Press, 1961. 
$6.95. 

Herring, "Charles J Mosely, Philip E., and 
Hitch, Charles J. HKesearch fo r Public 
Policy. (The book consists of lectures by 
three people plus comments by nine pan- 
elists. PW Jashington, D. C.: Brookings In- 
stitution, 1961. $2.50 C loth, $1.50 Paper. 

Hickman, Bert G. Growth and Stability of 
the Postwar Economy. Washington, D. C.: 
Brookings Institution, 1960. $6.00. 

Hitotsubashi University. Annotated Eco- 
nomic Statistics of Japan for Postwar 
Years up to 1958. Tokyo, Japan: Insti- 
tute of Economic Research, Hitotsubashi 
University, 1961. No price listed. 

Holt, Charles C., Modigliani, Franco, 
Muth, John F., and Simon, Herbert A. 
Planning Production, Inventories, and 
Work Force. Englewood Cliffs, New Jer- 
sey: Prentice-Hall, Inc., 1960. $10.00 
Trade, $7.50 Text. 

Horn, W. Reliability Survey: A Survey on the 
Reliability of Response to an Interview 
Survey. The Netherlands, 1961: Reprint 
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from Het PT T-bedrijf, Volume X, No. 3, 
October, 1960. $.75. Paper. 
International Labour Office. 1960 Year- 
book of Labour Statistics. Geneva, Swit- 
zerland: International Labour Office, 
1961. $6.00 Cloth, $5.00 Pa 
Jaffe, A. J. People, Jobs, and 
velopment. 


Jconomic De- 

Glencoe, Illinois: The Free 
Press, 1959. $6.00. 

James, Preston E., Editor. New View- 
= in Geography. Washington, D. C.: 

National Council for the Social Studies, 

1959. $5.00 Cloth, $4.00 Paper. 

Karreman, Herman F. Methods for Im- 
proving World Transportation Accounts, 
Applied to 1950-1953. Technical Paper 
15. New York: National Bureau of Eco- 
nomic Research, 1961. $1.50. Paper. 

Kemperman, J. H. B. The Passage Prob- 
lem for a Stationary Markov Chain. Chi- 
cago, Illinois: The Univerity Press, 1961. 


5.00. 

Klein, L. R. and Ball, R. J., Hazlewood, A., 
and Vandome, P. An Econometric Model 
of the United Kingdom. Oxford, Great 
Britain: Basil Blackwell, 1961. 60/-net. 

Kozelka, Robert M. Elements of Statistical 
Inference. Reading, Massachusetts: Addi- 
son-Wesley Publishing Company, Inc., 
1961. $5.00. 

Langer, Rudolph E., Editor. Partial mae 
ential a and Continuum, Mechan- 
ics. Publication Number 5 of the Mathe- 
matics Research Center, United States 
Army. Madison: University of Wisconsin 
Press, 1961. $5.00. 

Lieberman, Gerald J. and Owen, Donald B. 
Tables of the Hypergeometric Probability 
Distribution. Stanford, California: Uni- 
versity Press, 1961. $15.00 

Loveday, R. A Sevond Course in Statistics. 
New York: Cambridge University Press, 
1961. $1.85. 

Luce, R. Duncan, Editor. Developments in 
Mathematical Psychology. Chicago, Illi- 
nois: The Free Press, 1961. $7.50. 

MacNiece, E. H. Production Forecasting, 
Planning, and Control, Third Edition. 
New York: John Wiley and Sons, Inc., 
1961. $9.75. 

Mintz, Ilse. American Exports During 
Business Cycles, 1879-1958. Occasional 
Paper 76. New York: National Bureau of 
Economic Research, 1961. $1.00. Paper. 

Moore, Geoffrey H., Editor. Business Cycle 
Indicators, Two Volumes. Volume I: 
Contributions to the Analysis of Current 
Business Conditions. Volume II: Basic 
Data on Cyclical Indicators. Princeton, 
New Jersey: University Press, 1961. Vol- 
ume I, $12.50. Volume II, $4.50. Set, 
$15.00. 

Morgenstern, Oskar. The Question of Na- 
tional Defense, Second Revised Edition. 
New York: Vintage Books, 1961. $1.25. 
Paper. 

National Bureau F a Economic Research, 
Inc. Output, a and Productivity 

Measurement. Volume 25 in Studies in 
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Income and Wealth. Princeton, New 
Jersey: University Press, 1961. $10.00. 
National Bureau of Economic Research, 
Inc. Public Finances: Needs, Sources, and 
Utilization. Number 12 of — Con- 
ference Series. Princeton, New Jersey: 
University Press, 1961. $10.00. 4 

National Science Foundation. Current Proj- 
ecls on Economic and Social Implications 
of Scientific Research and_ Development, 
1960. Washington, D. C.: United States 
Government Printing Office, 1961. $.40. 
Paper. 

New York Department of Labor. Annual 
Report of the State Advisory Council on 
Employment and Unemployment Secur- 
ity. New York: a gctanag of Labor, 
1961. No price listed, paper. 

Oliver, F. R. The Control of Hire-Purchase. 
London, England: George Allen and 
Unwin Ltd., 1961. 25/-. 

Orleans, Leo A. Professional Manpower and 
Education in Communist China. Wash- 
ington, D. C.: National Science Founda- 
tion, 1961. $2.00. sae by Superin- 
tendent of Documents. 

Panton, Arthur William. The Theory of 
Equations, Two Volumes. New York: 
Dover Publications Inc., 1960. $1.85 
each. Paper. (Reprints: Volume I: Long- 
mans, Green and Company, 1912, Vol- 
ume II: Longmans, Green and Company, 
1928. 

Payne, , Wilson F. Industrial Demands 

pon the Money Market, 1919-67: A 

Study in Fund-Flow Analysis. Technical 

er 14. New York: National Bureau 

_ nels Research, Inc., 1961. $1.50. 


Pa 
: 
aper. 
Petecaen: W. Wesley. Error-Correcting 
Codes. New York: John Wiley and Sons, 
Inc. and the Technology Press of Massa- 
chusetts Institute of Technology, 1961. 


$7.75. 
Poole, E. G. C. Introduction to the Theory of 


Linear Differential Equations. New 
York: pow Publications Inc., 1960. 
$1.65. Paper. (Reprint: First edition by 
Oxford University Press, 1936.) é 

Qureshi, Anwar Iqbal. Develo ments in 
Pakistan Economy Since the Revolution. 
Karachi, Pakistan: Nabeel Publishing 
House, 1961. Rs. 5.00 Nett. , 

Rees, Albert. Real Wages in Manufacturing 
1890-1914. Number 70 in General Series 
of National Bureau of Economic Re- 
search. Princeton, New Jersey: Univer- 
sity Press, 1961. $3.75. ‘ 

Sagan, Hans. Boundary and Eigenvalue 

roblems in Mathematical Physics. New 
York: John Wiley and Sons, Inc., 1961. 
$9.50. 

Salant, Walter S. and Vaccara, Beatrice N. 
Import Liberalization and Employment. 
Washington, D. C.: Brookings Institu- 
tion, 1961. $6.75. 

Salzer, Herbert E. and Kimbro, Gene- 
vieve M. Tables for Bivariate Osculatory 
Interpolation Over a Cartesian Grid. San 
Diego, California: Convair-Astronau- 
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tics, General Dynamics Corporation, 
1960. No price listed, paper. 

Salzer, Herbert E. and Roberson, Peggy T. 
Tables of Coefficients for Obtaining the 
Second Derivative Without Differences. 
San Diego, California: Convair-Astro- 
nautics, General Dynamics Corporation, 
1960. No price mused, gute. 

Salzer, Herbert E., Shoultz, Dexter C., and 
Thompson, Elizabeth P. Tables of Oscu- 

Integration Coefficients. San Diego, 
California: Convair-Astronautics, Gen- 
eral Dynamics Corporation, 1960. No 
rice listed, paper. 

Schlaiefer, Robert. Introduction to Statis- 
tics for Business Decisions. New York: 
McGraw-Hill Book Company, Inc., 
1961. $7.75. 

Schwartz, D., Flamant, R., Lellouch, J., and 
Rouquette, C. Les Essais Theréapeu- 
tiques a Paris, France: Masson 
and Cie, Editeurs, 1960. No price listed, 


paper. 

Smith, Dan Throop. Federal Tax Reform: 
The Issues and a Program. New York: 
McGraw-Hill Book Company, Inc., 
1961. $7.00. 

Social Science Research Council. Annual 
Report 1959-1960. New York: Social Sci- 
ence Research Council, 1961. No price 
listed, paper. 

Somers, Herman Miles and Somers, Anne 
Ramsey. Doctors, Patients, and Health 
Insurance: The Organization and Financ- 
ing of Medical Care. Washington, D. C.: 
Brookings Institution, 1961. $7.50. 

Statistisches Bundesamt. Stichproben in der 
Amilichen Statistik. Mainz, Germany: 
W. Kohlhammer Verlag, 1960. No price 
listed. 

Tew, Brian. The International Monetary 
Fund: Its Present Role and Future Pros- 
pects. Number 36 in Essays in Interna- 
tional Finance. Princeton, New Jersey: 
International Finance Section, Depart- 
ment of Economics, Princeton Univer- 
sity, 1961. $.25. Paper. 

Theil, H. Contributions to Economic Analy- 
sis: Economic Forecasts and Policy, Sec- 
ond Revised Edition. Amsterdam, The 
Netherlands: North-Holland Publishing 
pe 1961. $11.25. 

Tolliver, Wayne E. (Under direction of 
Justin C. Lewis) Harned Degrees Con- 
ferred 1958-1959: Bachelor’s and Higher 
Degrees. Washington, D. C.: United 
States Department of Health, Education, 
and Welfare, 1961. $1.50. Paper. 

Twin Cities Metropolitan Planning Com- 
mission. Metropolitan Population Study, 
Part II: Numbers and Distinction, and 
Street and Highway Standards. St. Paul, 
Minnesota: Twin Cities Metropolitan 
Planning Commission, 1961. No price 
listed, 8 ae 

United Nations Education, Scientific and 
Cultural Organization, Department of 
Social Sciences. Current School Enrol- 
ment Statistics. Paris, France: UNESCO, 
1960. $1.00. Paper. 





PUBLICATIONS RECEIVED 


United States Department of Commerce, 
Business and Defense Services Admin- 
istration. 

Inventories in the Textile Cycle. $.45. 
Paper. 

Leisure and Work Clothing: Trends and 
Outlook. $.25. Paper. 

Washington, D. C.: United States Gov- 
ernment Printing Office, 1961. 

United States Department of Health, Edu- 
cation, and Welfare, Office of Education. 
Current Expenditures Per Pupil in Public 

School Systems, Urban School Sys- 
tems, 1958-1959. $.50. Paper. 

Enrollmeni, Teachers, and Schoolhousing. 
$.25. Paper. 

Opening (Fall) Enrollment in Higher 
Education, 1960: Institutional Data. 
$.30. Paper. 

Washington, D. C.: United States Gov- 
ernment Printing Office, 1961. 

United States Department of Health, Edu- 
cation, and Welfare, Public Health Serv- 
ice. 

Geographic Divisions and Large Metro- 
politan Areas, United States July 
1957-June 1969. 3.35. Paper. 

Health Statistics: Hospital Utilization in 
the Last Year of Life. $.30. Paper. 

Health Statistics: Selected Health Char- 
acteristics by Area. $.35. Paper. 

Washington, D. C.: United States Gov- 
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ernment Printing Office, 1961. 
United States Joint Economic Committee of 
Congress. 
Part If. 


Government Price Statistics, 
$1.50. Paper. 

January 1961 Economic Report of the 
President and the Economic Situation 
and Outlook. No price listed, paper. 

January 1961 Economic Report of the 
President with Minority and Other 
Views. $.35. Paper. 

Washington, D. C.: United States Gov- 
ernment Printing Office, 1961. 

Weitzman, Murray S. and Elias, Andrew. 
(United States Department of Com- 
merce, Bureau of the Census) The Mag- 
nitude and Disiribution of Civilian Em- 
ne ne in the U.S.S.R.: 1928-1959. 

ashington, D. C.: United States Gov- 
ernment Printing Office, 1961. No price 
listed, paper. 

Wozencraft, John M. and Reiffen, Barney. 
Sequential Decoding. New York: John 
Wiley and Sons, Inc., and the Technology 
Press of Massachusetts Institute of Tech- 
nology, 1961. $3.75. 

Zarkovich, Slobodan S. (Food and Agricul- 
culture Organization of the United Na- 
tions) Sampling Methods and Censuses: 
Volume I. Collecting Data and Tabula- 
tion. Rome, Italy: United Nations, 1961. 


No price listed, paper. 
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Pup J. McCarruy, Cornell University 

Rospert C. McCarry, Stanford Research 
Institute 

Paut Meter, University of Chicago 

WILLIAM MENDENRBALL, Bucknell Diahiselte 

Rupert G. Miuusr, Jr., Stanford Univer- 
versity 

Feux E. Moore, University of Michigan 

James N. Moraan, University of Michigan 

Stezitt Moricuti, University of Tokyo and 
Columbia University 

DonaLp F. Morrison, National Institutes 
of Health 

NatHAN Morrison, New York State De- 
partment of Labor 

Lincotn E. Moses, Stanford University 

J. Neyman,. University of California, 
Berkele 

HarRo.p Mrssaxeow, United States Bureau 
of the Census 

Artuur M. Oxun, Yale University 

Guy H. Orcutt, University of Wisconsin 

Louis J. Parapiso, United States Depart- 
ment of Commerce 

EMANUEL ParzEN, Stanford University 

Wituram S. Perers, Arizona State Uni- 
versily 

FRANK Proscuan, Boeing Scientific Re- 
search Laboratories 

Ronatp Pyke, University of Washington 

Roy Rapner, University of California, 
Berkeley 

Morton 8. Rarr, Bureau of Labor Sta- 
tistics 

Paut R. Riper, Wright-Patterson Air Force 
Base 

Joan Raup RosEensBuatt, National Bureau 
of Standards 

HerMAN Rusin, Michigan State University 

JEROME Sacks, Cornell University 

I. R. Savace, University of Minnesota 

L. J. SavaGce, University of Michigan 

EstHer Serven, Michigan State University 

ALBERT SHAPERO, Stanford Research Insti- 


tute 

WiuiaM C, SHeiton, United States Depart- 
ment of Labor 

SipNeEy Siece., Pennsylvania State Univer- 
sily 

Watt R. Simmons, United States National 
Health Survey 

Monrosk G. SirkEn, National Office of Vital 
Statistics 

Mitton SospeEt, University of Minnesota 

WiuuraM A. Spurr, Stanford University 

CHARLEs STEIN, ag University 

ANDREW STERRETT?T, Denison University 

GeorGE J. SticLer, University of Chicago 

Mervyn Srone, Princeton University 

DanizEt B, Suits, University of Michigan 

Rosert Summers, University of Pennsyl- 
vania 

Cuarues E. Swanson, The Curtis Publish- 
ing Company 

Lorie Tarsuis, Stanford University 

Rosert F. Tare, University of Washington 

BENJAMIN J. TeppinG, National Analysts, 


ne. 

Joun TuKeEy, Princeton University 

AnpDREW Vazsonyi, Ramo-Wooldridge 

D. F. Votraw, Jr., Yale University 

Davip L. Wauuace, University of Chicago 

Joun E. Watsn, System Development Cor- 
poration 

ALFRED G. Wuitney, Life Insurance Agency 
Management Association 

Ricuarp C, Witcock, University of Illinois 

8S. S. Wiis, Princeton University 

Hans Wourr, Michigan State University 

Lorin Woop, United States Bureau of the 
Census 

THEODORE D. Wootsery, United States Na- 
tional Health Survey 

Tuomas A. YANcEyY, University of Illinois 

N. DonaLp YLvi1SAKER, New York Univer- 
sity 

MarvVIN ZELEN, National Bureau of Stand- 
ards 
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PROBLEM-SOLVING SIMPLIFIED 


Here is a quality-in-depth data-processing 
and computer service that simplifies 
problem-solving for researchers and 
Statisticians. It gives you the benefit of the 
long experience of the oldest and largest 
independent data-processing service bureau. 
It applies the most advanced methods and the 
latest electronic computer and data- 
processing facilities. It combines the skills of 
statisticians, mathematicians, methods 
engineers and programming specialists to 
find the better way. 


Just call our nearest office for a free analysis 
and cost estimate to solve your problem. 


sic Established 1933 


Velstial 


TABULATING CORPORATION 
NATIONAL HEADQUARTERS: 


104 South Michigan Avenue * Chicago 3, Illinois 


OFFICES iN PRINCIPAL CITIES—COAST TO COAST 


‘\ 


Ss STATISTICAL MARK OF EXCELLENCE 


Please mention the Journal of the American Statistical Association in writing advertisers 








INDUSTRIAL 
STATISTICIAN 


The man we are seeking will have an M.S. or Ph.D. 
in Statistics or Mathematics and one to two years 
experience in engineering applications of statis- 
tics, reliability analysis or statistical quality con- 
trol. (New graduates with outstanding academic 
records will also be considered.) He will be given 
challenging assignments in applied statistics re- 
lated to research and development of nuclear 


weapons. 


This position is in our Livermore Laboratory located 
in sunny, smog-free Livermore, California—coun- 
try living just minutes from San Francisco. 


Similar openings at the M.S. and Ph.D. levels are 
available with our Albuquerque Laboratory. 


Sandia Corporation offers liberal employee bene- 
fits including vacation, retirement and insurance 
plans, an educational assistance program, and 
paid relocation allowance. U. S. Citizenship re- 
quired. All qualified applicants will receive consid- 
eration for employment without regard to race, 


creed, color, or national origin. 


Write to: Mr. V. G. Pappas, Division 501, 
Sandia Corporation, P.O. Box 969, Livermore, California 


SAN DIA 


CORPORATION 


LIVERMORE, CAL. 


Please mention the Journal of the Amenican Statistica AssociaTion in writing advertisers 














STATISTICIAN 


—Senior position to do statistical analysis and experimental! design. 


—Operational Research : Economic Forecast : Organoleptic Panels : 


Biological Assays. 


—tLarge diversified Food Company with excellent research facilities. 


—Digital computer available and instruction provided. 


Write, detailing training and experience to: 


Research Director 
Canada Packers Limited 
2200 St. Clair Avenue W. 


Toronto 9, Ontario, Canada 

















NOTEWORTHY STATISTICS TEXTS 


FROM MACMILLAN’S FORTHCOMING LIST 


Statistics for Psychological 
Research 
by William S. Ray, Professor of Psychology, 


The Woman's College of the University of 
North Carolina 


Designed for a one-semester course in psy- 
chological statistics, this intermediate text 
emphasizes the distinction between descrip- 
tive and inferential statistics, experimental 
and non-experimental research, and testing 
significance and estimating parameters. It 
develops the concept of psychological sta- 
tistics as an important part of the language 
of the research vehicle, not merely as a set 
of procedures for evaluating research data. 
Throughout, the text distinguishes between 
syntactical and semantic issues, stressing 
two major semantic problems—measurement 
and randomization of error. 

January, 1962 


Statistical Theory 


by B. W. Lindgren, Assistant Professor of 
Mathematics, Institute of Technology, Uni- 
versity of Minnesota 


This outstanding text for a one-year course 
in statistical theory or mathematical statis- 
tics includes much materiai that is either 
unique or unusually detailed. Sufficiency, 
minimal sufficiency, exponential families, 
Cramer-Rao inequality and efficiency, mono- 
tone likelihood ratio, uniformly most power- 
ful tests, decision theory, Bayes procedures, 
noncentral chi-square distributions, and 
Fisher-Yates and vanderWaerden compari- 
son tests are all thoroughly explored. The 
text opens with a development of the prob- 
ability theory needed for an understanding 
of the remainder of the course. Later chap- 
ters discuss statistical problems within the 
framework of decision theory, with the spe- 
cial language and topics of estimation and 
testing. January, 1962 


The Macmillan Company 


A Division of 
Zl 


The Crowell-Collier Publishing Compan 
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TWO IMPORTANT TITLES 
PREFACE TO ECONOMETRICS 


By Michael J, Brennan, Jr., Brown University 


This book provides an introduction to quantitative methods that can be under- 
stood by the undergraduate. Emphasis is placed on applications of economic 
theory, quantitative measurement, and tests of correctness of theory. An exten- 
sive group of problems is included at the end of each chapter. 


BUSINESS STATISTICS 


By John R. Stockton, The University of Texas 


In this book the student is given sufficient acquaintance with the most com- 
monly used methods of analysis to enable him to use statistical data effectively 
in making business decisions. It discusses the problems to which statistical 
analysis can be applied as well as the metheds of analysis. The mathematics 
needed by the student is explained as the problems of statistical analysis and 
formulas are presented. 





SOUTH-WESTERN PUBLISHING CO. 
(Specialists in Business and Economic Education) 
Cincinnati 27 New Rochelle, N.Y. Chicago 5 Burlingame, Calif. Dallas 2 











NEW IRWIN BOOKS FOR FALL CLASSES 


BUSINESS AND ECONOMIC STATISTICS, Revised Edition 


By WILLIAM A, SPURR, Stanford University, LESTER S. KELLOGG, Deere and 
Company, and JOHN H. SMITH, the American University 

Introduces the student systematically to statistical analysis as it is employed in all 
phases of business. 


BUSINESS AND ECONOMIC FORECASTING: An Econometric Approach 


By MILTON H. SPENCER, Wayne State University, COLIN CLARK, University of 
Oxford, and PETER W. HOGUET, The Econometric Institute, Inc. 


Emphasizes both the principles and techniques of forecasting by developing and 
integrating economic and statistical concepts. 


BASIC MATHEMATICS FOR BUSINESS ANALYSIS 
By 0. J. CURRY and JOHN E. PEARSON, both of North Texas State College 


Presents to the student an opportunity to gain an understanding of the mathematical 
techniques most commonly used in our business society. 


Write for Examination Copies 


RICHARD D. IRWIN, INC. ° HOMEWOOD, ILLINOIS 
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The Annals of Mathematical Statistics 


THE OFFICIAL JOURNAL OF THE INSTITUTE OF 
MATHEMATICAL STATISTICS 





Vol. 32, No. 3 September, 1961 


Contents 


Oskar Anderson, 1887-1960 
On Fiducial Inference 
A Central Limit Theorem for Partly Dependent Variables 
.H. J. Goodwin and 8S. K. Zaremba 
Some Multivariate Chebyshev Inequalities with Extensions to Continuous Param- 
eter Processes Z. W. Birnbawm and Albert W. Marshall 
Maximal Independent Stochastic Processes . B. Bell 
On Markov Chain Potentials John G. Kemeny and J. Laurie Snell 
Markov Chains with Absorbing States: A Genetic Example G. A, Watterson 
Estimation of the Spectrum V. K. Murthy 
On a Coincidence Problem Concerning Particle Counters Lajos Takdcs 
On the Ruin Problem of Collective Risk Theory N. U. Prabhu 
The Random Walk between a Reflecting and an Absorbing Barrier ..B. Weesakul 
On the Queueing Process M/G/1 O. R. Heathcote 
The Sequential Design of Experiments for Infinitely Many States “. Nature . 
Arthur E. Albert 
Analysis of a Class of PBIB Designs with more than Two Associate Classes 
P. V. Rao 
On a Locally Most Powerful Boundary Randomized Similar Test for & inte 
pendence of Two Poisson Variables . Ahmed 


Confidence Intervals from Censored Samples po een 

Tests of Fit Based on the Number of Observations Falling in the Shortest Sample 
Spacings Determined by Earlier Observations Lionel Weiss 

Some Nonparametric Median Procedures V. P. Bhapkar 
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